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CHAPTER  I 


V 


where  L  ia  the  non-singular  Hermitian  variance- covariance  matrix  and  £*' 
represents  the  transpose  of  the  conjugate  of  £  •  The  mean  of  the  vector 
£  is  assumed  to  be  0  .  Wooding  (1956)  also  derived  the  characteristic 
function  for  the  above  distribution  and  found  it  to  be  given  by 


where  T  is  some  arbi*  ^ry  complex  parameter. 

It  appears  that  N.  R.  Goodman  (1963),  working  independently  and 
using  the  same  assunptions  as  given  in  (1.1.0),  has  also  derived  the  conplex 
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multivariate  normal  distribution.  He  extended  the  development  further 
by  deriving  the  complex  analogue  for  the  real  Wishart  distribution,  de¬ 
noting  it  as  the  complex  Wishart  distribution.  Also,  ''Goodman  derived 
the  distributions  of  the  complex  analogues  for  the  sample  multiple  ai.d 
sample  partial  correlation  coefficients.  In  the  complex  case,  these  are 
denoted  as  the  sample  multiple  coherence  and  the  sample  partial  coherence. 

All  developments  that  have  followed  in  this  area  of  complex  multi¬ 
variate  analysis  have  been  based  on  the  assumption  of  the  density  function 
specified  by  Wooding  (1956)  or  Goodman  (1963). 

Other  authors  who  made  early  contributions  in  this  area  of  complex 
multivariate  analysis  have  been  M.  S.  Srivastava  (1965),  N.  Giri  (1965), 

D.  G.  Kaibe  (1966b)  ,  and  C.  G.  Khatri  (1965).  D.  G.  Kabe  (1965®)  gave  a 
simplified  method  of  deriving  the  distribution  of  the  sample  multiple 
coherence  and  the  sample  partial  coherence.  He  represented  the  density 
functions  in  a  finite  series  while  Goodman  (1963)  had  presented  these  as 
infinite  series.  Kabe  (1966b)  developed  the  complex  analogue  some  results 
in  classical  multivariate  normal  regression  theory.  Srivastava  (1965) 
gives  a  direct  and  simplified  method  of  deriving  the  Wishart  distribution. 
Khatri 's  (1965)  contributions  were  in  many  areas.  He  obtained  the  maxi¬ 
mum  likelihood  estimates  of  the  parameters  of  the  complex  multivariate 
normal  given  by  Wooding  (1956)  and  Goodman  (1963);  he  then  derived  the 
distributions  of  these  estimates.  Also  included  in  his  work  are  pro¬ 
cedures  for  transforming  from  densities  of  complex  random  variables  to 
other  complex  random  variables.  Using  these  techniques,  Khatri  (1965) 
derived  the  distributions  of  the  sample  multiple  coherence. 

Khatri  (1965)  also  derives  the  distribution  of  Wilks'  A  in  the 
complex  case.  He  gives  a  representation  of  Wilks'  A  (in  the  complex 
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case)  as  the  product  of  real  independent  beta  variables.  It  is  this 
development  that  proves  useful  to  the  work  given  in  this  dissertation. 

Through  the  work  of  the  above  authors  the  foundation  for  complex 
multivariate  analysis/  based  on  the  assumption  of  a  complex  multivariate 
density  as  given  in  (1.1.1)/  has  been  established.  It  is  the  purpose  of 
this  dissertation  to  extend  this  development  by  examining  Milks'  A  which 
appears  as  a  solution  to  some  inference  problems  associated  with  this 
particular  complex  multivariate  normal  distribution. 

This  extension  is  covered  in  Chapter  III  by  an  examination  of 
Wilks'  A  as  presented  in  the  multivariate  analysis  of  variance  of  com¬ 
plex  normal  data,  the  regression  analysis  of  conplex  normal  variables 
upon  real  dummy  variables/  and  discrimination  among  several  complex 
normal  populations  with  the  same  variance-covariance  matrix  but  with 
different  means.  Also  the  goodness  of  fit  of  a  hypothetical  discriminant 
function  is  considered. 

In  Chapter  II/  some  basic  theorems  dealing  with  the  complex  multi¬ 
variate  normal  are  established  along  with  the  Bartlett  decomposition  of 
a  complex  Wish  art  matrix.  These  results  are  used  in  establishing  some 
different  expressions  for  the  sample  coherence  and  sample  multiple 
coherence.  These  results  prove  useful  in  working  with  Wilks'  A  in 
Chapter  III.  Chapter  IV  presents  a  summary  of  the  work  covered  in  this 
dissertation  along  with  possible  extensions  of  this  research  into  other 
areas. 

2.  Definitions  and  Notation 

Let  S  denote  a  d  *  1  complex  random  vector. 


C' 


tV  V 


(1.2.0) 
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where  £_,  =  +  iy..  .  Denote  the  (2p  *  1}  vector  of  real  and  imaginary 

components  by 

s'  ■  tv*i«  W  ■"  '  VVixJp 

and  let  this  random  vector  have  a  2p-variate  normal  distribution  with 


6'  =  E(n')  =  p_,v  ,  •••  ,  \i  ,v  ] 

*.  JL  2.  A  P  P 


and  the  variance-covariance  matrix  given  by 


=  E[(n  -  5)  fn  -  6)  •] 


where  the  2x2  submatrices  of  have  the  following  special  form 


E  (x_.  -  Eix.)  ]  [Xj^  -  Efx^)  ]  E  [Xj  -  E(Xj)  ]  [y^  -  Efy^)  ] 

E [y j  -  E(y.)][^  -  E^)]  E [y j  ~  E(y.)][yk  -  E(yk)] 


!  2 

r. 

1  o. 

1  0 

\  JL 

\  * 

_°  1 

if  i  =  k 


O  .0, 


a.,  -  6., 

3k  jk 

B  “  a 
.  lk  lk 


if  j  ¥  k. 


(1.2.2) 


Then  the  density  function  of  n  is  given  by 


P  (n ) 


(211 )  E 


p*  ,1/Jexpr  2(n  -  f-rEn1(2  -  f-> 


(1.2.2) 


provided  that  |E^|  ^0  . 
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It  has  been  shown  by  dooding  (1956)  and  Goodman  (1963)  that  if 
condition  (1.2.2)  is  satisfied  for  then  £  is  distributed  as  a  p- variate 
complex  normal  vector  with  probability  density  function  given  by 


P(£)  -  0IP|e  I)'1  exp[-(£  -  jj)*'!’1^  -  y)]  (1.2.4) 

where  the  mean  p  is  given  as 

p'  -  El5')  ■  [p.+iv  ,  p.+iv.,  •*•  ,  P  +iv  ] 

-  -  1122  pp 

and  the  variance -covariance  structure  is  given  by  the  Hermitian  positive 
definite  matrvx  ,  where 


E e  “  E((§  -  P)(§  -  p)*’l 


and  the  elements  of  are  given  as 

(  °k  if  J  *  k 
0jk  "  ) 

({°jk  +  AV  j*  if 


The  density  function  (1.2.4)  will  be  denoted  as  CN^(p,  E)  . 

Let  F(p  denote  the  distribution  function  of  the  complex  random 
vector  £  with  density  given  as  C*Mp,  E)  .  By  the  probability  that  £  is 
less  than  ,  the  following  is  meant. 

■’%>  -  PIS  <  S0'  '  p(5r  <  Sor  ’  W  a-2-5’ 

where  §R  denotes  the  reel  pert  of  §  and  £x  denotes  the  real  random  variable 


attached  to  the  imaginary  quantity. 


6 


Let  •••  ,  £  denote  n  independent  and  identically  distri- 

-1  -2  -n 

bitted  p-variate  complex  normal  vectors  with  density  given  by  (p ,  1}  . 
Then  the  random  matrix  S,  defined  by 


n 

s  =  l  (5  -  l)  <5  -  5)** 

i=l 

where 


has  been  shewn  by  Goodman  (1963)  and  Khatri  (1965)  to  be  distributed  as 
a  complex  Wishart  with  probability  density  function  given  by 


P(S)  -  [rp(n)f  1|l|"n|s|n~1"P  exp [-  tr  L^S] 


where 

yP(p-l) 

r  (n)  =  TT 
p 

The  conplex  Wishart  density  function  with  parameter  l  and  degrees  of 

freedom  n-1  will  be  denoted  as  CW  (sln-lji'  , 

P 

Let  where  £ '  =  [£. ,  C_ ,  *  •  •  ,  £  ] ,  be  distributed  as  a  CN  (i  ,  l)  . 

1  2  p  p  - 

As  in  real  multivariate  analysis,  the  relationship  between  the  component 
£  and  a  linear  combination  of  5.,  L,  ,  C  ,  has  a  very  important 

p  12  p-1 

role  in  complex  multivariate  analysis.  In  the  real  case  this  relation¬ 
ship  is  measured  by  the  multiple  correlation  coefficient  which  is  defined 
as  the  maximum  correlation  between  cne  component  and  a  linear  combination 
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of  the  others.  In  complex  multivariate  analysis,  Goodman  (1963)  has 
termed  this  quantity  as  the  multiple  coherence  and  shown  it  to  be  given 
by 


V2  I12E113:12  E21E11E12 

yv  v  *”  '  w  =  =  “ — 

pp  pp 


(1.2.6) 


where  E  ,  E0^  are  obtained  from  the  variance-covariance  matrix  E 

which  is  partitioned  as 


i 

! Ei2 

!  E_, 

'  a 

j_  21 

'  PP  J. 

.atri 

(1955!  h 

2 

•V 

». 

\ 

r~ 

1 

ir  '*2* 

Goodman  -1963)  and  Xhatri  (1955!  have  given  the  distribution  of 


p-r 


„,f2,  (n)  ,,  „2%n#'2.p-2,,  n-p„ ,  ,  „2~2, 

f(R  )  =  — t~ - rr  (1-P  )  (R  )  (1-R  )  *F(n,  n  ;  p-1,  P  R  ) 

i  i.)*  (n  p  / 


where  P  denotes  (1.2.6),  the  peculation  multiple  coherence  of  Ep  with 

~2 

E,,  E_,  •••  ,  E  •  R  denotes  the  sample  estimate  and  F(  ,  ;  ;  5  denotes 

12  p 


the  hypergeometric  function,  i.e.. 


F(n,  n  ;  p-1,  P2R2)  =  J 


r  (n+j)  .  r(n+j)  .  r  (p-d  #  (p2r2)3 
r(n)  ‘  r(b)  ’  r(p-i+j)  '  j  i 


CHAPTER  II 


I.  Basic  Results 

There  is  a  great  deal  of  similarity  between  the  multivariate  analysis 
of  complex  random  variables  and  real  random  variables.  Despite  this 
similarity,  it  seems  worthwhile  to  include  the  following  results  with 
formal  proofs  in  order  to  provide  a  firm  foundation  for  the  material 
presented  in  latter  chapters. 

In  the  case  of  real  normally  distributed  random  variables,  it  is 
of  great  importance  to  establish  the  distribution  of  the  square  of  the 
random  variable.  The  analogy  of  this  in  the  con?) lex  case  is  the  complex 


randor,  variable  time?  ita  conjugate. 


A  rat  ff\ 


1  * 


As  pointed  out 


in  Chapter  I,  Z  is  composed  of  two  real,  independently  distributed  normal 

2 

variables  with  y  »  0,  and  c  «=  1/2  .  Consider  the  real  random  variable 

2  2 

defined  by  y  =  ZZ*  =  x  +  y  which  is  the  sum  of  squares  of  two  indepen¬ 
dent  N(0,  1/2)  random  variables.  From  real  univariate  theory,  it  is 
clear  that 


y 


X 

ZZ*  “V  - 


2 

ill 

2 


(2.1.0) 


2 

where  stands  for  a  chi-square  random  variable  with  f  degrees  of 

freedom. 

Let  denote  a  complex  random  vector  with  probability  density 

function  given  by 


P(§')  -  <nP|E| )‘1  exp [-  <£  -  y)*'E"1(§  -  y)]  . 


e 
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Consider  the  transformation  given  by 

Y  m  a  i 

-r*l  r*p  -p*l 

where  A  is  a  real  or  complex  matrix  of  rank  r  <  p.  If  r  ■  p,  then  the 
probability  density  function  of  the  random  vector  Y  is  given  by 

P(Y')  -  (nPi E I )_1  expt-  (A-1Y  -  y)*,E"1(A-1Y  -  y) ]  •  |j| 

where  J (x  -*■  y)  is  given  by  Khatri  (1965) 

j-W'V'f1  • 

Thus  the  density  function  of  Y  becomes 

P(Y*)  =  (nP|AEA*'|)_1exp(-  (Y  -  Ay) *' IAEA* ' )  1 (Y  -  Ay)]  . 

This  result  is  stated  formally  as  the  following  theorem. 

Theorem  2.1.  If  £  is  distributed  accordingly  to  CNp(y,  E) ,  then  Y  »  A£ 
is  distributed  accordingly  to  CNp(Ay ,  AEA*’) ,  where  A  is  a  p  x  p  real  or 
coop  lex  matrix  of  rank  p. 

Now  consider  the  transformation 

-rxl  ”  Arxp  ^pxl 

where  the  matrix  A  is  of  rank  r  <  p.  The  mean  of  Y  is  given  by 

E (Y)  »  E(AC)  =  AE(£)  «  Ay 
and  the  covarian'-e  is  given  by 

E  [Y  -  E(Y)][Y  -  E  (Y)  *'  ]  =  AE  (£  -  y)  (5  -  y)*’A*’ 


=  AEA*’ 
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Since  A  is  of  rank  r,  it  is  known  that  there  exists  a  matrix 
rxp 

C,  .  such  that  the  transformation 
(p-r) xp 

fa 

\-(p-r)xl/  V‘/ 

is  a  non-singular  transformation.  Then  Y  and  Z  will  have  a  joint  c can- 

pie  x  normal  distribution  and  the  marginal  distribution  of  Y  will  be 

CN^fAy,  AEA*').  This  result  will  be  stated  as  the  follcwing  theorem. 

Theorem  2.2.  If  £  is  distributed  accordingly  to  CN^(y,  E) ,  then  Y  =  A  £ 

is  distributed  accordingly  to  CN^CAy,  AEA* ') ,  where  A  is  a  r  x  p  matrix 

of  real  or  complex  elements  of  rank  r  <  p. 

Now  let  K  ,  ^  CB  (0,  I  ),  and  make  the  transformation 
-pxl  p  -  pxp 

-pxl  pxp  -pxl 

where  U  is  a  unitary  matrix,  i.e.,  0*'U  =  UU*'  =  I  .  Then  by  Theorem  2.1, 

P 

Y  ,  *v>  CN  (0,  I  ).  It  should  be  noted  that  the  elements  of  U  must  be 
-pxl  p  -  p 

constants.  If  the  elements  are  random  variables,  the  transformation  is 
a  random  unitary  transformation  and  the  distribution  of  Y  is  not  necessarily 
that  of  a  CM  (0,  I  ).  In  either  case 

p  -  p 

Y*'Y  -  §*'U*'U  5  =  £*'§  .  (2.1.1) 

Thus,  §*'§  remains  invariant  under  a  unitary  transformation.  Now  consider 
the  transformation 


-rxl 


A  5  . 

rxp  -pxl 


(2.1.2) 


where  the  matrix  A  is  of  rank  r  <  p  and  has  the  additional  property 
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A  A*’  =>  I  .  This  transformation  will  be  referred  to  as  a  semi-unitary 

rxp  pxr  r 

transformation.  As  in  Theorem  2.2,  it  is  known  that  the  matrix  A  may 
be  augmented  with  a  matrix  B  such  that  the  transformation 


-rxl 


^-(F-r)xl, 


is  of  full  rank  and  furthermore  it  is  unitary.  Thus 


'  Y  . 
-rxl 


(p-r)xly 


K  CN  (0,  I  ) 
P  -  P 


and  from  the  invariance  property  (2.1.1) 


«;{i  *  5552  +  •••  +  -  viyi  +  ?iy2  +  • 


+  Y*Y  +  Z*  + 

r  r  r+1  r+1 


+  2*Z 
P  P 


and  the  variables  Y2,  •••  ,  Yf  ,  Z^,  Z^,  •••  ,  Zp  are  distributed 

as  independent  0^(0,  1)  as  are  the  Cj*  £2>  ***  >  £  • 

From  (2.1.1)  we  have 


Y*'Y  =  Y*Y  +  Y*Y_  +  •••  +  Y*Y 
-  -  11  22  rr 


and  since  Y^,  Y2#  •••  ,  Yr  for  an  independent  set  of  CN^O,  1),  then  from 

(2.1.0) 


ViYl  +  V2J2  *  •"  +  ',?r'  2 


2r 


Also 


2 

X~ 

_±£ 


Hi  zH  pp  2 
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Then 


Ei{i  +  *  " 


*  5;s  - «?! + 


•  +  Y*Y  )  'u  2  ~  — 

r  r  2 


Since 


/ 


z*  ,z  ,  + 
r+l*r+l 


+  Z*Z  =  (5*5,  +  •••  +  5*C  )  -  (Y*Y  +  ••*  +  Y*Y  )  . 

pp  11  pp  11  rr 


This  result  is  the  complex  analogue  of  Fisher's  Lemma,  and  is  formally 
stated  as  the  following  theorem. 

XZ 

Theorem  2.:.  If  5  ,  ^  CN  (0,1),  then  5*5'  ^  -=?-  and  if  Y  .  ■  A5  . 

- - -  -pxl  p  -  P  -  -  2  2  -rxl 


where  A  is  a  semi-unitary  transformation,  then  Y*'Y  ^  „ 

v2  -  -  2 


2r 


Moreover, 


Z*'Z  »  5*'5  -  Y*'Y  •u 


21b=*L 


and  is  independent  of  Y*‘Y  . 


Let  X  =  [§  ,  £  ,  •••  ,  5  1  where  the  5.  ^  Independent  CN  (0,  E) 
j,  *  **n  -l  p  “ 

and  consider  matrix  of  Hermitian  forms,  XAX*’  where  A  is  a  real  idempotent 
matrix  of  rank  r.  The  distribution  of  XAX*'  needs  to  be  established. 

Since  A  is  a  symmetric  matrix,  the  spectral  decomposition  of  A  may  be 
used,  A  may  be  written  as 


P 

I 

i=l 


x .  e .  0 : 


where  the  X^  are  the  characteristic  roots  of  A,  and  Lhe  are  the 
corresponding  orthogonal  characteristic  vectors  of  A.  since  A  is 
idempotent  of  rank  r,  then  X^  =  1,  for  i  =  1,  •••  ,  r  and  X_.  =  0  for 
j  ■  r  +  1,  •••  ,  p,  ths  the  spectral  decomposition  of  A  is  given  by 


*-  l  Mi 

i=l 


and 
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r  r 

XAX* 1  »x  [  B.B.'x*'  »  y  XB.B.'X*' 

,  ,  -i-i  .**„  -i-i 

i=>l  i=<l 

Since  £,  ^  CN  (0,  E),  then  the  distribution  of  the  matrix  X  is  given 
-i  p  -  pxn 

by  CN^^n(0,  I  ©  E)  where  8  stands  for  the  Kronecker  product.  Making  the 

transformation  Y,  «  X8.  ,  it  is  seen  that  Y.  ^  CN  (0,  BIB.  ©  E)  for 

— i  “1  -l  p  u  -i-i 

i  ■  1,  2,  •••  ,  r  which  is  the  same  as  Y.  ^  CN  ((?,  E)  since  BIB.  *  1 

-l  p  -  -i-i 

and  the  Y. ,  Y_,  •••  ,  Y  form  an  independent  set  of  CN  (0,  E)  since 
"l  -r  p  — 

BIBj  *  0,  thus 

r  r 

XAX* '  =  l  XB.SIX*'  =  l  Y.Y*'  ^  CW  (XAX*'|r|E)  . 

i=l  1  1  i=l  11  ^ 

This  result  will  be  stated  formally  as  the  following  theorem. 

Theorem  2.4.  X  =  fC,  ,£„,*•*,  £  ]  where  £.  are  distributed  as 

-  —i  -il  -1 

Independent  CN^(0,  E)  ,  then  for  A  a  real  indepotent  matrix  of  rank  r  <  p, 

the  matrix  of  Hermitian  forms  XAX*’  is  distributed  as  a  CW  (XAX*'|r|E  )  . 

P  P 

2.  Bartlett  Decomposition  of  a  Coup  lex  Wishart  Matrix 
Kshirsagar  (1959)  has  presented  an  elegant  method  of  deriving  the 
Bartlett  decomposition  of  a  real  Wishart  matrix,  i.e. ,  expressing  the 
Wishart  matrix  as  the  product  of  a  lower  triangular  matrix  times  its 
transpose  and  giving  the  distribution  of  the  elements  of  the  lower  tri¬ 
angular  matrix  by  using  random  orthogonal  transformations.  This  has  been 
achieved  in  the  case  of  a  complex  Wishart  matrix  by  following  the  above 
procedure.  The  minor  changes  that  are  encountered  are  pointed  out  at  the 
conclusion  of  this  section. 

Let  §!'  §2'  ***  '  §n  where  =  t5li'  £2i'  ***  '  5pi,f  denote 
n  independent  and  identically  distributed  p-variate  complex  normal  vectors 
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with  probability  density  function  given  by  CN^  (0,  1^) .  Denote  the  matrix 


of  the  vectors  by  Y,  i.e.,  X  =  [£^,  * 

Let  A  denote  the  matrix  defined  by 


'  -n^pxn 


a  =  xx*1  =  y  i .  s  *  • 
.  ,  -1-1 
1=1 


then  it  has  been  shown  [Khatri  (1965) ]  that  the  matrix  A  is  distributed 

as  CW  (a|  n|  I  ) . 

P  1  P 

Now  consider  the  vectors  defined  by 


5l  = 


"V 

?21 

?22 

• 

Cpi 

SP2 

• 

’ll 

5 12 

« 

!  Z„  = 

•  •  •  •  •  *7  -- 

»  r  A  — 

• 

-2 

• 

-p 

• 

• 

• 

• 

}ln 

l  P»lJ 

(2.2.0) 


which  are  the  vectors  of  observations  on  each  of  the  p-variates.  Each 

of  the  variables  is  independent  and  distributed  as  CN^(0,  1). 

Orthogonalize  these  vectors,  Z  ,  •••  ,  Z  ,  by  using  the  Gram-Schmitt 

i  P 

process  such  that  the  new  vectors  form  an  orthonormal  basis.  The  new 
vectors,  Y^,  Y^,  **•  ,  Y,  may  be  represented  as 


*,  -  bn5l 

-2  '  h21h  * 


y  ~  ^^7  4-  +  •  •  •  +  j^PPz 

-P  *-2  "P 
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ii 

The  b  for  i  »  1,  2,  •••  ,  p  can  be  chosen  to  be  real  while  the  b  J, 
k  >  j,  k  «■  1,  2,  •••  ,  p  j  ■  1,  2,  •**  ,  p  will  in  general  be  complex. 

To  see  this  consider  [Hahn  (1958) ] 


and  since  b11  is  to  be  chosen  such  that  Y*'Y^ 
which  is  real.  Now 


1,  this  implies  b^  = 


21  22 
Y  =  b  Z,  +  b  Z„ 
-2  -1  -2 


and  with  the  condition/  Y*'Y2  =  0,  this  implies 

*  Z  +  b1Xb22  •  Z*’Z  •=  0  . 

-1  -1  -1  -2 

21  22 

This  is  a  homogeneous  equation  in  two  unknowns,  b  '  and  b  .  A  complete 


solution  is  given  by 


b21  =  -  (Z* *  •  Z2)t 
b22  -  (Z* 1  •  Zx)t 


where  t  is  an  arbitrary  parameter.  Continuing  in  this  fashion,  it  can 
33 

be  shown  that  b  is  proportional  to  (Z* ’Z^) (Z* ’Z,) .  So  if  t  is  chosen 

22  33  ii 

to  be  real,  we  have  b  real,  also  b  real,  and  all  other  b  will  be 

real.  Attention  should  be  called  to  b  ,  k  >  j  .  For  a  real  t,  this 

)c  1 

does  not  imply  that  b  J  will  be  real.  Thus  the  b^,  i  *  1,  2,  ••*  ,  p, 
will  be  restricted  to  be  real. 

Then  the  above  system  may  be  written  as 


16 


Since 


Y  ®  B  X 
pxn  pxp  pxn 


pxn 


Let  B  ”  [b^]  ’  Since  the  diagonal  elements  of  B  are  real,  then  the 
diagonal  elements  of  B  1  =  B  must  necessarily  be  real.  Hence,  for 
i  "  1,  2,  •••  ,  p  are  real  elements  and  the  reciprocal  relation  may  be 
written  as 


or  as 


(2.2.2) 
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which  implies  that 


^k  "  bki-l 


+  b,  „Y„ 
k2-2 


+  bkk4 


(2.2.3) 


i.e.,  the  can  be  expressed  in  terns  of  the  unitary  vector*  y^, 
k  -  1,  •••  ,  p  . 


From  (2.2.2)  it  is  seen  that  the  matrix  X  can  be  represented  as 
the  matrix  B  times  the  semi -unitary  matrix  Y,  then 


XX*'  =  BYY*'B*'  =  BB*'  ,  since  YY*'  =  I 

P 

The  distribution  of  XX*'  is  CW  (Alnll  )  thus  BB*’  *  CW  (A|n|l  )  or  that 

P  1  P  P  P 

A  =  BB*'  .  Thus  a  complex  wishart  matrix  can  be  expressed  in  terms  of  a 
lower  triangular  matrix,  whose  diagonal  elements  are  real  and  the  non¬ 
zero  off  diagonal  elements  are  complex,  times  the  transpose  of  its  conjugate. 
Also  note  that 


-j'5*  b*j 


k  =  1,  **•  ,  p 
j  -  1,  2,  •••  ,  k 


(2.2,4) 


this  result  follows  from  (2.2.3). 


Now  keep  Z ,  Z2» 


Z^  fixed.  This  automatically  fixes 


h'  -2 '  "•  ■  4-1  '  bU’  •"  '  Vl.l  ■  b12- 


•  Vi,k-1  by  virtue 


of  relation  (2.2.4) .  Also  recall  that  the  are  distributed  independently 


as  01^(0,  1).  Consider,  from  (2.2.4),  the  b^fc ,  t  *  1,  2,  •••  ,  k-1,  which 


may  be  written  as 
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bki  -  n'\ 

bk2  -  'i-v\ 


(2.2.5) 


b.  .  ,  =■  Y*'  Z, 
k  k-1  -k-l-k 


and  by  holding  Z^,  Z^,  •••  ,  Z^_.  ii.xed,  this  becomes  a  semi-unitary 

"’kl '  bk2'  '  “k  k-1 


transformation  from  ?kl,  •••  to  ,  b,^,  •••  ,  b,. 


Observing  that 


‘V  -2' 


•  r>  •  •  •  *7  1 

'  ’  -V  '  V 

fixed  random 


automatically  requires  that 


ft 


or 


<11 

r’21 

•  •  • 

fjk-l 

1 

r 

’ki 

•  •  • 

r 

’Pi 

r 

"12 

?22 

•  •  • 

?k-l 

2  ; 

?k2 

•  •  • 

5P2 

• 

• 

• 

• 

• 

• 

. 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

i 

• 

• 

5  in 

2n 

• .  • 

Ck-1 

i 

n  i 

?kn 

•  •  • 

i 

5  1 

pn 

fixed 


random 


% 


A 


Thus 


b  .  =■  Y*’Z  for  i=1.2, 
K]  -j  -k 


,  k-1 


is  distributed  ai  a  linear  combination  of  the  £,  where  t  =  1,  2, 

kt 

and  the  are  istributed  as  independently  CT^fO,  1).  Thus 


•  ,  n 


b  .  ^  ON  (0,  11 

X  L 


(2.2. 6) 
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since  var(bk  J  *Y*'Y.j  =  1,  and  E(b^k)  “  0*  Furthermore,  the  b^t  forms 


an 


independent  set  for  t  ■  1,  2,  •••  ,  k-1.  Observe  that  for  t  j<  t1 


cov!v  bkf> -  “v<>rv  vj'v  -  *rv  -  ° 


Thus  (2.2.6)  is  the  conditional  distribution  of  the  b,  .  with 

kj 

Z1#  2^ ,  ,  Zk_^  being  fixed,  the  Z^s  do  not  enter  into  the  distri¬ 

bution,  so  (2.2.6)  is  also  the  unconditioned  distribution  of  the  b,  . ‘s, 

kj 

for  j  <  k  and  the  b  . 's  form  a  mutually  independent  set.  Hence,  the 
result 


,  b  .  'v  Independent  CN_  (0,1)  . 

K  K~X  jL 


(2.2.7) 


As  stated  previously,  the  transformation  (2.2.5)  is  a  semi-unitary  trans¬ 
formation  from  ?kl,  •••  ,  ?kn  to  bkl,  bk2,  ,  bk  k  .  This  trans¬ 
formation  can  be  completed  with  [n  -  (k-l) ]  new  random  variables  in  such 
a  way  as  to  maintain  independence  between  b^,  bk2,  ,  bfc  k  ,  and 
the  [n  -  (k-l)]  random  variables,  so  that  the  complete  transformation 
is  unitary.  Since  the  transformation  is  of  full  rank  and  unitary,  then 


^l?kl  +  ^2?k2  +  -  +  «Sn*kn  * 


bklbkl  +  ***  +  ^k  k-l^k  k-l  +  “  (k-l) ]  random  variables. 


Rewrite  the  anove  as 


5k'5k  “  bklbkl  +  "•  +  bk  k-l\  k-l  +  In  -  (k-ln  ra“<to"  variables ^ _ 2. Q) 


and  note  the  independence  among  the  variables  on  the  left  once  more . 

x/2 


CSlff  hiOVinA«>A  /  O  O  0\  4  o  ri  Kn4*n4  -»c*  a 

”UJkW»vIlSwi.«/  •  V/  XU  WUaOUX  XMUbVU  uu  u 


(2n) 


,  but  by  (2.2. 3)  ,  it 


is  seen  that 
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z*z.  =  b*  b  +  •••  +  b*  ,b.  .  ,  +  hr, 

-k  -k  kl  kl  k  k-1  k  k-1  kk 


Also,  by  virtue  of  (2.1.0),  it  is  seen 


2 

X2 (k-1) 


b*  b.  ,  +  +  b*  .  ,b,  ,  ,  ^ 

kl  kl  k  k-1  k  k-1  2 


Thus  by  Fisher's  Lemma,  it  is  seen  that 


b. ,  =  Z**Z,  -  (b*  b  ,  + 
kk  -k  -k  kl  kl 


+  b*  b  ) 
k  k-1  k  k-1' 


{2  [n  -  (k-1)  1 


2 

and  b  is  independent  of  b  ,  •••  ,  b  and  Z  ,  Z  ,  •••  ,  Z  . 

KK  Kl  K  K-1  “ X  ™4  -K-1 

This  result  js  true  for  every  k  =  1,  2,  •••  ,  p. 

In  b  rnimary,  it  is  seen  that  a  complex  Wishart  matrix  A,  may  be 

represented  as  the  product  of  a  lower  triangular  matrix,  B  ■*  [b  ] ,  where 


b^  ,  k  =  1,  2,  •••  p  ,  are 


.  v  >  -i  .  .. 

r  —  j  r  — 


e  con*" lex  tidies 


the  transpose  of  its  conjugate  where  the 


b^  ^  Independent  <3^(0,  1)  for  j  <  k,  k  -  1,  2,  •••  ,  p 


and 


b2  „  _2Jn_  ~  Ml 

kk  2 


These  results  are  similar  to  that  of  the  decomposition  of  a  real 

Wishart  matrix  in  that  the  b  ,  ^  Independent  N(0,  1)  for  j  <  k, 

*  J 

k  -  1.  2,  •••  ,  p  «rd  •v  X jn  .  (k.ln  • 


w 
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3.  Coherence  Between  Two  Complex  Random  Variables 


Ruben  (1966)  obtained,  in  the  case  of  real  bivariate  normal 
situation,  a  very  good  approximation  to  the  distribution  of  the  cor¬ 
relation  coefficient.  This  was  done  by  expressing  the  sample  corre¬ 
lation  coefficient  as  a  ratio  of  a  linear  combi nation  of  a  standardized 
normal  variate  and  a  chi-variate  to  another  chi -variate,  where  all  three 
were  mutually  independent.  This  work  has  been  extended  to  the  multi¬ 
variate  situation  by  Kshirsagar  (1969) ,  who  gives  a  matrix  representation 
of  this  result  for  correlation  between  two  vectors. 

Following  Ruben's  woik  in  the  bivariate  case  and  Kshirsagar's 
work  in  the  multivariate  case,  similar  results  have  been  obtained  in 
the  complex  case.  These  will  be  presented  here. 


Consider  a  random  sanyle 

—  X  —X. 

c Ve  ,  E)  where 


E  of  size  n  from  a 
an 


|yl+  iVl 


y,  +  i-v- 

2  4 


°11  J12 


021  °22 


The  Wishart  matrix  of  sum  of  squares  of  products  is  given  by 


ki  *12 


*21  *22 


■v  CW2(A|n-l|E)  , 


where 


^  («i)  -  V* 


I 


k«=l  ,  ^ 

z  =  -  for  l  =>  1,  2 

i  n 
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The  coherence  between  z ^  and  z is  given  by 


ril°22 


and  the  sample  coherence  is  given  by 


r alla22 


Define 


(1-pp*) 


M1/2 


(1-rr*) 


(2.3.0) 


It'  is  the  quantity  r  whose  distribution  will  be  approximated. 

Now  make  the  transformation  from  E,  to  Y  by  the  following  trans¬ 
formation 


*2*1  “  C(^  " 


where  C  is  given  by 


1/2 

ou(l-pp*)i/ 


022<1-pp*) 
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Prou  Theorem  2.1,  the  distribution  of  Y  is  given  )  y  CN2  (0,  GC'  ■  l2> 
and,  furthermore ,  the  Wishart  matrix  A  has  been  reduced  to  its  canonical 
form,  i.e.,  D  ■  CAC* '  'v  (d|  n— 1|  I^) .  These  results  are  suranarized  in 

the  following  table 


Variable 

K 

Y  -  C(£  -  ji) 

mean 

V 

? 

variance 

l 

I 

Wishart  matrix 

A 

D  =  CAC*' 

The  Wishart  matrix  D  «•  CAC*'  has  the  form 


D  - 


11 

2 

11 


-P  *a 


U 


2  ,,  *,1/2 

°11  1-pp  * 


a*2 


-pa 


11 


12 


2  ,,  *,1/2  ,,  *,1/2 

ail  1-PP  °lia22  1_PP 


where 


5  - 


a  pp  * 
11  M 


*12P 


*12P* 


*22 


2  2 
°11(1'PP#)  °11°22(1_PP*)  CT11°22(1”PP*)  a22 ( 1_pP 


Since  D  ^  cm  {d|  n— 1 1 )  ,  it  may  be  written  as  D  =■  3B*’  where  B  is  a 
P  ^ 

lower  triangular  matrix  given  by  the  Bartlett  decomposition.  BB* '  is 
given  by  (observing  the  b^  for  i  *  1,  2  are  real) 


'bll 

0  " 

bll 

b;r 

Mi 

bnbh 

b21 

b22_ 

0 

b22_ 

mi 

b  b* 

L  21  11 

lb21b2*l  *  b=2>. 

and  equating  the  elements  of  BB* '  with  the  corresponding  elements  of 
D  -  CAC*' ,  the  following  system  of  equations  is  c  tained. 
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(2.3.0)/  the  following  representation  for  r  is  obtained 


b12  *  C  bll 


which  can  be  rewritten  as 


CN  (0,  1)  +  p  -^5=2 
1 


(2.3.2) 


where  xv  denotes  a  real  chi-variable  with  v  degrees  of  freedom.  Attention 
should  be  called  to  the  distribution  of  r  in  'he  null  case,  i.e.,  p  =  C, 


CN(0,  1) 


which  has  the  appearance  of  what  could  be  called  a  complex  t. 

Consider  a  probability  statement  about  the  complex  random  variable 


defined  by  (2.3.2) 


P(r  <  rQ)  *  P 


CN  (0,  1)  +  p 


or  rewritten  as 


where 


P (L  <-  0) 


. .  -  X2n-2  -  X2n-4 

L  =  CN  (0 ,  1)  +  p - a  - 

/2  °  /2 


(2.3.3) 
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Now  following  Kshirsagar's  (1969)  work,  a  matrix  representation  for 
coherence  between  two  complex  vectors  will  be  derived.  Let  the  variance- 
covariance  of  two  complex  vectors  X  ^qxl  ***  9^v®n  ky 


E 

E 

11 

12 

E 

E 

21 

22 

(2.3.4) 


(p+q)  x  (p+q) 


and  let  the  matrix  of  corrected  sum  of  squares  (s.s.)  and  sum  of  products 
(s.p.)  of  observations  from  a  sample  of  size  n+1  on  these  variables  be 
given  by 


s 

s 

11 

12 

S21 

S22 

P 

q 

(2.3.5) 


(p+q)  x  (p+q) 


which  is  based  an  n  degrees  of  freedom.  Then  the  following  matrices  can 
be  obtained: 


Bq  «  matrix  of  regression  coefficients  s12Sll'  °*  £  00  ?  ? 

B  ■  matrix  of  s.s.  and  s.p.,  due  to  regression  si2S22S21  * 

A  -  "residual"  s.s.  and  s.p.  matrix  -  s12s22s21  *  sn.2  * 

A+B  «  "total"  matrix  . 

The  corresponding  matrices  for  the  population  are  (5  ■  E 12^22  '  "12^22^21  ' 
and  f11<2  -  En  -  E12E22E21  '  *e*P«ctivBly. 
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If  the  vector  I(pfq)xi  defined  by  Y'  ■  [X' | Z' ]  is  distributed  as 
C**(p*q)^#  where  V '  ■  lyxl V«zl  and  E  is  defined  by  (2.3.4),  then  the 
natrix  S,  defined  by  (2.3.5)  is  distributed  as  C*(p+q) (s| n| £ )  and  froei 
this  density  it  has  been  shown  by  Kabe  (1966)  that 


3,,  ^  *  or  (s,,  Jn-q|E,,  J 

11*2  p\  11*2'  11*2/ 


(2.3.6) 


and  that  S^t2  i®  distributed  independently  of  Bq  and  s22  .  Also, 
Kabe  (1966)  has  shown  that 


S 


22 


and  that  ,  for  fixed  S22 ,  is  distributed  as 


CN  (B,  £ 
P  ~ 


11*  2 


) 


(2.3.7) 


(2.3.8) 


and  is  independent  of  2  . 

Now  let  *,  n,  M,  F,  C,  and  K  be  lower  triangular  matrices  such 


that 


“22 

‘11*2 

S22 

A 

B 

A+B 


_  « 

-  nn** 

-  MM*' 

-  FF*' 

-  KX*' 

-  CC*  ' 


Define  further 
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0  -  iT^Bq  -  B)M  (2.3.9) 

V  -  *“1M 
w  -  n”1? 

P  "  r-1^**-1 

R  - 

R  -  C~Sc 
L  -  RR*'  ■= 

Transform  to  0,  V,  and  W  frost  B^,  Sll*2'  resPecti-vely  f  id  (2*3*6) f 

(2*3.7),  and  (2.3*8).  Then  it  can  be  shown 
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regression  sum  of  squares  to  the  tota1  sum  of  squar-s  and  RR* '  is  the 

2  2 

matrix  generalization  of  r  /1-r  ,  the  ratio  of  the  regression  sum  of 

squares  to  the  residual  sum  of  squares.  Observing  that 


u  +  pv  - 


where  J  is  the  population  matrix  corresponding  to  R,  then 


R  -  W-1 (U  +  PV) 


(2.3.10) 


which  is  the  matrix  analogue  to  the  result  (2.3.2),  i.e., 


oyo,  1)  +  p 


J?. 


X 


2n-4 

/2 


which  corresponds  to  Ruben's  result  in  the  real  case.  Then 


£  ■  U  +  PV  -  WRq 


(2.3.  ") 


is  the  matrix  analogue  of  result  (2.3.3),  and  also  corresponds  to 
Kshirsagar's  result  in  the  real  case. 

Prom  this  result  (2 . ^ . 11) ,  one  can  obtain  results  similar  to  Ruben's 
about  every  element  of  the  matrix  however  in  multivariate  analysis ,  one 
praters  an  overall  criterion  based  on  the  whole  matrix  rather  than  indi¬ 
vidual  results  on  the  elements.  Hence,  no  attempt  is  t  ie  to  pursue  this 
further;  instead,  we  shall  be  considering  overall  criteria  such  as  Wilks' 


A  in  the  complex  case. 
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4.  Multiple  Coherence 

In  the  case  of  real  multivariate  normal  analysis,  an  extensive 
amount  of  work  has  been  done  relating  to  the  multiple  correlation 
coefficient,  i.e.,  the  maximum  correlation  between  a  random  variable 
x  and  a  linear  combination  of  a  set  of  variables  x, ,  x„,  •••  ,  x  . 

p  12  p-X 

If  ,  where 


y  =  ! 


°11  °12  "• 

•  «  • 

«  •  • 

•  •  • 

aip 

• 

• 

- 

r 

£u 

r 

“  12 

rr  G  •  •  • 

a 

a 

pi  p2 

pp 

21 

PP 

denotes  the  variance -covariance  structure  of  this  set  of  variables 

x, ,  y.  ,  ♦  *  •  ,  x  ,  it  can  be  shown  that  the  square  of  the  maximum  cor- 
1  2  p 

relation  coefficient  is  given  by 


xp<V  •••  ,  xp_1) 


y  y"*V 
2111  12 

a 

PP 


and  the  same  multiple  correlation  is  given  by 


VV  '  Jtp-i) 


A21A11A12 


PP 


where 


A  = 


A 

A,  „ 

11 

12 

A_  , 

a 

21 

PP 
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:.s  the  sample  variance-covariance  matrix  based  on  a  sample  of  size  n. 

2  2 

Hodgson  (1967) ,  following  Ruben's  (1966)  approach,  has  shown  R  /(1-R  ) 
is  distributed  as 


2 

Xn-k-l 


where  the  x  and  the  unit  normal  variate  U  are  mutually  independent. 

Prom  this,  Hodgson  has  obtained  a  normal  approximation  for  the  statirtic 

2  2 

P.  /(1-R  ).  These  results  have  been  achieved  in  a  somewhat  similar 
fashion  for  the  multiple  coherence  in  a  multivariate  complex  normal 
situation. 


Suppose  t,  'll  Oi1  (it,  T.) .  Without  loss  of  generality,  it  will  be 
assumed  that  m  -  0  and  I  has  the  following  special  form 


where  R  is  the  true  multiple  coherence  between  5  and  £^,  ***  #  5  ^  » 

is  defined  in  (1.2.6).  This  can  be  shown  as  below: 

Let  £  'u  CN  (y,  E),  where 
P  " 


and  E 


La 


o  ,  o  , 

pl  p2 
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E_ ,  o 

pp 

21  1  pp 

Observing  that  an  hemitian  positive  definite  matrix,  then  there 

exists  a  D  such  that  E. .  *  DD*'  and  D  T„  j>* '  “  =  i  .  Make  the  following 


DD*'  and  D_1EjjD* ,_1  =  i 


li  ”  -  1 r  >-1 

transformation  from  Z  -*■  Y  by  Y  =  A(£  -  y) ,  '..here  A  has  the  following 

form 


then  E (Y  )  *  (?  and  var(Y)  =  AEA*’,  observe 
-pxl 


AEA*’ 


Furthermore,  the  multiple  coherence  remains  invariant  under  this  trans¬ 
formation,  since 


y  '  r-lr 
12  11  12 
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CnD*— lD-XE12  JaI^u 


W  V  '  Yp-1) 


i*6*i 


?ep«1.  •••  »  ^p.x)  “  PVp(Vr  V2,  •••  ,  Yp  l)  • 


Thus  we  have 


where 


Y  .  ^  CM  (j»,  e) 
-px  1  p  - 


E. 

IP 

Vi 

• 

• 

m 

Ep-lp 

Epl ep-lp 

1 

Ocnsider  the  true  regression  of  Yp  on  Y^/  Y^#  •"  •  Yp_i'  can  ^ 

written  as 


SdplYj.  «2.  —  .  Yp_j)  -  6^  ♦  B2*2  +  "•  +  VlVl  ’ 

where  8'  -  e*  I  -  [e  e  •••  ,  e  ]-  Make  the  following 

p-1  pi  p2  p-i  p 

trans  f omation 
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«2 

x,  -  — . .  y+ 


a  .  a  . 

-p-l-p-l 


x  °  y 

p  p 


where  a_,  a,,  •••  ,  a  ,  are  so  chosen  that 
-2  -3  -p-1 


-  5j£  - 


5p-l£  *  0  • 


i.e.,  find  p-2  vector  orthogonal  to  e  ,  and  noting  the  following  relation¬ 
ships  among  the  x's  . 


vartx^  =  1  for  i  «=  1,  2,  •••  ,  p 


cov(x  ,  x.)  =  a*a.  »  0  for  i  f  j ;  i  =  1,  2 ,  • • •  ,  p-1 
13  1  3  j  <=  1,  2,  •••  ,  p-1 

covfx^,  xj  ■=  f^'a^  =  0  for  i  =  2,  •••  ,  p-1 


-xjv(x  ,  x.)  ■  BJ'a.  ■  0  for  i  *=  2,  *•*  ,  p-1 
pi  31 


co v(x, ,  x  )  =  cov 
1  P 


\  p  /  /b*7} 


p 


Thus  we  have 


C»p(0,  $) 


x  * 
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where 


(2.4.0) 


Let  ***  /  £  denote  a  random  sample  of  size  n  from  a 

-1-2  -n 

0^(0,  E)  where  E  is  of  the  form  given  by  (2.4.0).  Let  A^  denote  the 
complex  Wish art  matrix  associated  with  this  sample,  i.e.. 


n 

K  =  l  S.  ??'  *  CW  (A[n|  E)  . 

^  i=l  p 

It  is  desired  to  look  at  the  Bartlett  decomposition  of  this  Wish art. 
matrix,  but  the  distribution  of  is  not  in  the  canonical  form.  By 
making  a  transformation  of  the  original  sample,  it  is  possible  to 
express  A^  as 

Aj.  =  CTT* '  C*  ' 

where  T  is  a  p  x  p  triangular  matrix  whose  elements  are  independent  random 
variables,  the  off  diagonal  elements  being  0^(0,  1)  and  the  diagonal 
elements  being  x  variables.  To  see  this,  consider  the  following  argument. 

Let  £  ^  CNp(0,  E),  E  is  hermitian  positive  definite,  thus 
E  -  CC*'  .  Transform  from  A  to  A  by 

^  C-lA^C*'-1  , 
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where 


or 


Z  m  CC*' 


A?  <=»  CA^C*'  »  CTT*'C*' 


(2.4.1) 


which  is  the  desired  result. 

Now  if  5,^  *»*  0*^(2*  s)  30(1  s  ^  of  the  form  (2.4.0) ,  the  matrix  C, 
such  that  C“l£C*’"i  ■  Ip  is  given  by 


1  0 
0  1 


0 

0 


0 

0 


e  • 


•  • 


•  •  •  • 


o  o 

P  0 


1  0 
o  a-P2)1''2- 


and  from  (2.4.1)  ,  A  is  given  by  A  «=  CTT*'C*'  .  First  examine  CT, 


CT 


Vi 

0 

_  P  0  •••  0 

(l_p2)l/2 

S'T  +  (l-P2)1/2^’  (1-P2) 1/2tpp 


observing 


S't  +  u-p2)2t'  -  lptn#  0  0  •••  O]  +  [o 


2  2  2  7 

V  ’  v“'  °-p  >•*, 


and  letting  X1  represent  this  quantity,  thm 


CT  = 

T 

0 

X’ 

<I-P2>1/2t 

pp 

and  CET*,C*'  is  given  as 

r 

CTT*'c** 


fJWf'*  * 
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.J 

n  >* v 
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1 

X’T*  ’ 

d-p";  t" 

- 
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<•  x'x* 
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observing  that 


X'X*  =  [Ptu  +  (l-P2)17^]  [Pt„  +  (1-.  2)1/2t*1] 


pi' 1  11 


Pi 


Hence 


2  P;X 

(1-P  )  l  t  4t*.  . 

1=2  pipl 


A.  . 

A 

11 

12 

A- , 

a 

21 

PP 

TT*' 


■rx* 
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x’t*  ’  ’  n-pv-d 


(2.4.2) 


Interest  lies  with  the  quantity  R2/(l-R2).  Obs>.  rve  that 
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1-R2  - 


| A__ I  *  a 

11  pp 


(2.4.3) 


which  may  be  rewritten 


1-R2- 


2  P  2 

(l-p  )  n  t:. 

_ i-1  11 _ 

•  {(i-p2)t2  +  x'x*} 

pp  — 


2  2 
(l-P)t. 


{(1-P2)t2  +  X'X*} 


(1-P2)t2 

PP 


.2, 1/2.  ti2  .  „2%  Pf1 


ItPt..  +  (1-P  )  t  }|  +  (1-P  )  l  t.t* 


px  px 


2  2 

U-pV 

PP 


i*Pi -  (p/(i-p2)i/2)tui2 ♦  T  tplt*. 


(2.4.4) 


2  1  /2  2 

where  |t  ^  +  (P/(l-P  )  )t^|  represents  the  modulus  of  the  quantity. 


Let  6 


'(1-P2) 1/2 


— -  t  ,  +  t,.  I  and  usinq  the  fact  that  t  ,  ^  CN  (0,  1) 

P  pi  11/  pi  1 


and  t^  which  is  a  chi-variable  and  can  be  approximated  by  a  real 


can  be  approximated  by 
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CHAPTER  III 


1.  A  Teat  for  Equality  of  Means 

In  the  case  of  k  real  p-variate  normal  populations,  tests  have  been 
developed  for  testing  hypotheses  related  to  the  means  of  these  k  popu¬ 
lations.  Test  statistics  have  been  developed  by  Roy,  Law ley.  Hotelling, 
Pillai,  and  Wilks.  Wilks'  statistic,  usually  referred  to  as  Wilks'  A, 
is  the  simplest  one  and  is  related  to  the  likelihood  ratio  criterion. 

Thus  it  has  the  desirable  properties  associated  with  the  likelihood  ratio 
procedure.  It  is  for  these  reasons  that  the  complex  analogue  of  Wilks' 
statistic  will  be  used  to  develop  tests  for  distinguishing  between  the 
neans  of  k  conplex  p~variate  normal  populations. 

Let  samples  of  size  n^,  n2,  ***  ,  n^  be  available  from  k  ■  q+1 
complex  p-variate  normal  populations  with  means  y(l),  V(2) ,  ••*  ,  y(k) 
and  the  same  variance  covariance  structure  given  by  E.  Let  denote 

the  vector  of  responses  for  the  member  of  the  a  population  (l  m  1, 

2,  •••  ,n  ;  a  =■  1,  2,  •••  ,  k)  on  which  the  observations  are  made.  It 
a 

is  desired  to  develop  a  test  for  the  hypothesis  of  equal  means,  i.e., 

H0:  =  li(2)  =  *  )±00 

against  the  alternate  hypothesis  that  the  populations  have  different 
means.  Represent  the  alternate  hypothesis  by  Ha. 

Giri  (1965)  has  shown  that  the  likelihood  ratio  criterion  for  certain 
hypotheses  about  conplex  multivariate  normal  populations  possesses  optimum 
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properties  which  are  characteristic  of  this  test  in  the  real  case.  These 
properties  will  not  be  investigated  here,  but  since  the  likelihood  ratio 
test  has  been  found  satisfactory  for  the  problem  in  the  real  case,  it  will 
be  used  here  in  the  complex  case.  The  author  feels  that  the  desirable 
properties  are  maintained. 

Letting  N=n^+n2+***+n^,  then  the  likelihood  function,  Q, 
may  be  written  as 


Q  =  — np  '  rrir  expr  1  1 

(!!)  (  £  )  (  a=l  0  -1 


k  na 


-  i  r  [sai  -  ual  -  • 

0=1  p  1 


The  likelihood  ratio  statistic  is  given  by 


Max  Qr 


Max  Q 


where  Max  is  the  maximum  of  the  likelihood  function  under  the  assump¬ 
tion  that  the  null  hypothesis,  H0,  is  true  and  Max  Q  is  the  maximum  value 
of  the  likelihood  function  over  the  entire  parameter  space,  denoted  by  ft. 
The  test  is  given  by  rejecting  HQ  when  cA  is  less  than  some  specified  con¬ 
stant  depending  on  the  size  of  the  test. 

Using  Giri's  (1965)  results,  the  Max  is  obtained  when 


X  na 

=  i  Y  V  ? 

N  -at 


o=l  t=l 


k  nr 


-  T  where  T  =  l  l  (£ 


a=l  £=1 
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The  Max  Q  i3  given  when 


n 


U  „  _L  V  f  =  r 

n  .*■-  — afc  -a* 

a  £=1 


and 


,  n 

k  a 

£  =  A  =  J  A  where  A  -  7  (£  ,  -  £  >(£.,•£  >*' 

a  a  -a£  -a*  -an  -a* 

(X=l  0.01 


After  some  simplification,  it  can  be  shown  that 


A  ■ 


|A| 


T 


and  observing  that 


T  <=  A  B 


where  B  is  defined  by 

k 

B  -  I  n  (t  -  5  )  (5  -£)*’. 

*■,  a  »a*  2ar 

a<=l 

Then  the  likelihood  ratio  statistic  is  given  by 

M 

A  - - 

C  IA+Bj 

The  statistic  is  the  complex  analogue  of  the  result  obtained  by  Wilks; 
it  will  be  referred  to  as  Wilks'  A  in  the  complex  case  and  denoted  as  cA  . 

Gup- a  (1971)  has  derived  the  exact  distribution  of  when  the 
alternate  hypothesis  is  of  unit  rank.  He  gives  the  explicit  expression 


for  the  case  of  p  =  2,  3  and  general  degrees  of  freedom  for  the  hypothesis 


and  error. 

The  results  obtained  by  application  of  the  likelihood  ratio  principle 
may  be  summarized  in  the  following  coup  lex  multivariate  analysis  of  variance 
table '  (CMANOVA) . 


Source  of  Variation 

mm 

s.s.  and  s.p. 

Betv-een  ...  'or..'.;; 

q 

k 

B  —  l  n  (5  -K  )  (£,  -g  )*' 
c,= 1 

A  =  t 

Within  groups 

n-q 

TOT.u 

n  =  N-l 

k  "a 

a=l  £=1 

+ (by  subtraction) . 


That  the  degrees  of  freedom  are  indeed  q  for  the  between  groups  and  n-q 
for  the  within  groups  will  now  be  shown  by  examining  the  distributions  of 
the  matrices  B  and  A. 

Define  a  p  x  n  matrix  X  by 


X  = 


f-Il'  -12' 


Let  denote  an  N  »  B  matrix  composed  of  all  ones  and  let  e„„,  denote 
NN  ~NX 1 

a  vector  with  all  elements  being  unity.  Also  define  e^,  an  N  x  1  vector, 
with  zero  everywhere  except,  in  the  nQ  positions;  there  are  ones  in  all  of 
the  n  positions,  i.e., 

e;  .  10  0  •••  0  •••  0  o  •••  0llyH  . 

n  ones 
ct 
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Then  tin?  matrices  A,  B,  and  A+B  may  be  written  as 


''"Xk-  l  ,f  Sad**’  •XDlt*'  ! 

L  a=l  a  J 

A+B  =  X  L  -  z:  E%_,  X*'  =  XEX* '  . 

L  N  N  NNJ 


X*’  =  XCX*' 


(3.1.0) 

(3.1.1) 

(3.1.2) 


It  should  be  noted  that  the  matrices  C,  D,  and  E  are  idempotent  of  ranks 
q,  n-q,  and  n,  respectively. 

Observing  that 

E (X)  =  M  =  [y  (1)  ,  y  (1)  /  **  *  /  V  (1)  |  *  *  * !  JJ  (k)  ,  y  (k) ,  •  •  •  ,  y  (k)  ]  (3.1.3) 
and  recalling  that 

then  by  Theorem  2.4, 

(X-M)C(X-M)*'  -v  CWp((X-M)C(X-M)*’|qjl)  .  (3.1.4) 

Thus  the  degrees  of  freedom  associated  with  the  between  groups  sum  of 
squares  (s.s.)  and  sum  of  products  (s.p.)  matrix  B,  is  given  by  the  rank 
of  the  idempotent  matrix  C  which  is  q. 

From  (3.1.3),  it  can  be  shown  that 

E (XCX* ' )  =  qZ  +  MCM*' 

and  under  the  assumption  of  HQ,  i.e.. 


equal  means  for  the  k- comp lex 
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p-variate  populations,  M  can  be  written  as 


M  ”  BSlxM 

where  jj  represents  the  common  value.  Observe  that  under  H0, 

MC  «  (7  . 


Then 


(X-M) C (X-M)*'  *=  (X-M)  C  (X* 1  -M* ' ) 

e  XCX*'  -  XCM*'  -  MCX* '  +  MCM*' 
<=  XCX* '  . 


Thus 

XCX*'  a,  cwp(xcx*'|q|j:)  if  yd)  -  y(2)  =  •••  o  y(k)  . 

New  the  distribution  of  A  ■  XDX*'  needs  to  be  established.  From 
Theorem  2.4,  it  is  seen  that 

(X-M)D(X-M)*'  -v  CWp(CX-H)C(X-M)*' |n-q|E)  . 

And  the  degrees  cf  freedom  associated  with  A  are  given  by  n-q,  the  rank 
of  the  idempotent  matrix  D. 

New 


(X-M) D (X-M) *'  *  XDX*’  -  XDM* '  -  MDX* '  +  MDM* 1 

and  it  can  be  shewn  that 


MD  -  p  . 
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Hence 


(X-M)a(X-M)*'  -  XDX* '  <v  acp(xDX*'|n-q|E)  . 

Attention  should  be  called  to  the  fact  that  the  distribution  of  A  -  XDX* ' 
is  independent  of  the  null  hypothesis,  i.e. ,  regardless  of  whether  H0  is 
true  or  not,  the  distribution  of  A  is  still  CNp(A|n-q|E) . 

Also,  it  should  be  noted  that  A  and  B  are  distributed  independently. 
Since,  as  in  the  real  case,  two  forms  XCX* '  and  XDX*'  are  independent  if 
CD  -  g  . 

To  summarize  these  results,  it  is  seen  that 

B  *  CWp(B|q|E)  if  p(l)  -  p  (2)  ®  •••  -  p  (k) 

A  ^  CWp(A|n-q|E)  independent  of  H0 

and  A  is  distributed  independently  of  B.  The  test  statistic,  A,  is  given 

c 

by  the  ratio  of  the  determinates  of  the  complex  Wishart  matrices  A  and 
A+B ,  i.e., 

a| 

A  = 
c 

Khatri  (1965)  has  shown  that  the  distribution  of  A,  where  A  is 

c  c 

defined  as  above,  is  the  same  as  the  product  of  p  independent  real  beta 
variables,  i.e.. 


I  aJ  p 

A  a  -  «  n  U 

c  I  A+B I  i=l 


where  ^  B(n-q  -  i+1,  q)  and  all  are  independent  for  i  «■  1,  2,  •••  ,  p. 
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Furthermore ,  he  has  shown 


-m  log  A 
3  c 


2 

X2pg 


where  m  =  (2n-q-p)  and  "s>"  denotes  "approximately  distributed." 

Hence  if  H0  is  true,  -m  log  A  will  be  distributed  as  an  approximate 

c 

chi-square  variable  with  degrees  of  freedom  given  by  2pq. 

To  summarize  the  procedure  for  testing  the  hypothesis  of  equal  means 
among  k  =  q+1  complex  p-variate  normal  populations,  one  must  first  cal¬ 
culate  the  matrices  A  and  A+B.  Then,  consider  the  statistic  A  given  by 

c 

M 

A  =  - 

°  I A+B j 

2 

and  coiipute  -m  log  ^A  .  If  -m  log  ^  is  greater  than  x2pg  at  the  desired 
a-level,  then  reject  HQ,  otherwise  do  not  reject  H0  . 


2.  A  Test  for  the  Dimensionality  of  the  Mean  Space 
In  the  last  section  a  test  for  equality  of  the  means  from  k  p-variate 
complex  normal  populations  with  the  same  variance  covariance  structure  was 
developed.  The  hypothesis,  H0,  implies  that  the  mean  vectors  of  the  k 
populations  would  lie  in  a  complex  space  of  zero  dimension,  i.e.,  repre¬ 
sented  by  a  point  in  the  complex  space.  If  H0  is  rejected  in  favor  of  Ha, 
then  the  means  of  the  k  populations  may  lie  in  a  complex  1  dimensional 
space,  that  is  they  are  collinear,  or  in  a  complex  2  dimensional  space, 
that  is  they  are  coplanar,  and  so  forth  up  to  a  complex  k  dimensional 
space. 

In  real  multivariate  analysis,  it  is  important  to  know  the  dimen¬ 
sionality  of  the  space  spanned  by  the  mean  vectors  for  this  is  equivalent 
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NOT  REPRODUCIBLE 

to  knowing  the  r.uaiV  r  o'-  :.i.  -.nr  uincrir: ir  "~t  ftincr.?.Ci~5  that  arc  nacesc  •  y 

to  distinguish  among  the  k -populations.  This  is  true  also  in  the  complex 
normal  situation.  In  this  section  a  test  for  the  dimensionality  of  the 
complex  space  spanned  by  the  mean  vectors  will  be  developed  and  will  be 
shown  to  be  equivalent  to  the  number  of  linear  discriminant  functions 
needed  to  discriminate  among  the  k  populations. 

Recalling  the  complex  multivariate  analysis  of  variance  table  that 
was  developed  in  the  previous  section,  and  adding  a  column  for  the  expec¬ 
tation  of  the  s.s.  and  s.p.  matrices,  then  the  table  is  given  by 


Source  of  Variation 

d.f. 

s.s.  and  s.p. 

E(s.s.  arid  s.p.) 

Between  groups 

q 

B  *=  XCX*' 

E  (B)  «  q£  +  MCM* ' 

Within  groups 

n-q 

A  =  XDX* '  ; 

E  (A)  =  (n-q)E 

TOTAL 

n 

A+B 

Now  M  was  defined  by  (3,1,4)  as 

E  (X)  =  M  =  [jj(1)  •••  jj  (1)  |  •  •  •  |  M  (k)  •••  E(k)]pxN  • 

Denote  the  rank  of  M  by  r(M)  and  observe  that 

r  (M)  =  r([p  (1)  ,  y(25,  ,  p  Oc)  ] )  -  r  (p) 

which  is  equivalent  to  the  number  of  linearly  independent  points  repre¬ 
senting  the  means  of  th?  k  populations .  Denote  the  k  populations  by 

it  ,  -n  ,  *  •  •  ,  tt  and  consider  a  linear  combination  of  §,  an  observation 
12  k 

from  one  of  these  populations,  given  by  £*'£  •  Suppose  further  that 
H*'w(l)  -  £*'u(2)  *=  •••  «  '  p  (k)  ■=■  6  , 
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i.e.,  that  l*'E,  has  the  same  mean  in  tr^,  ***  ,  ir  .  Then  this  linear 
function  given  by  H*'  would  not  be  useful  in  discriminating  among  the 
populations,  i.e.,  one  could  not  determine  which  population  £  came  from 
by  examing  Jl*'£  .  Furthermore,  observe  that 


i*’M 


[6  6  •  •  *  6]  =  6  •  e 


1*N 


then 


£*■*=«  el.»  c' 


;  6  e 


(l  Iee'  -  IE  ^ 

1*N|  L ,  n  -a-ct  N  NN I 
\a=l  a  / 


=  {  O'  =  O' 


and  hence 


2.*‘MCM'*,8.  -  0  . 


Conversely,  if  =  0,  this  implies 


k 

T  n  (u  -  u) (u  -  u)*'  =  0  (3.2.0) 

L .  a  a  a 


k  n  £*'y  (a) 

where  £*'y(a)  =  u  and  u  =  J  - - - - .  But  (3.2.0)  implies  that 

-  u  ,  Ln 

a=l  a 

each  term  of  the  summation  must  be  zero  or  that 


u  =  u  for  all  a  =  *,  2,  •••  ,k. 


From  this  it  is  seen  that 


£*  ’  p  (1)  =  £*'y<2)  =  =  £*'y(k) 


51 


So  if  £*'?  has  the  same  maan  value  in  '*•  /  ^  then 

£*'MC  »  0  and  conversely  if  £*'MC  =»  o,  then  £*'5  will  have  the  same 
mean  value  in  ir^,  ir^,  •••  #  ir^  •  In  either  case  £*'£  will  not  be  use¬ 
ful  as  a  discriminator. 

Thus  a  linear  function  of  §,  given  by  £*'£,  will  not  be  a  good 
discriminator  if 

£*'MC  =  0  .  (3.2.1) 

An  adequate  number  of  linear  discriminators  will  be  given  by  p  minus  the 
number  of  independent  £'s  which  satisfy  (3.2.1).  For  any  £,  such  that 
(3.2.1)  is  satisfied,  £*'£  will  be  called  a  covariate.  Observe  that 
(3.2.1)  is  equi valent  to 

MCM*'£  =  C  . 

So  the  maximum  number  of  such  covariates  is  equal  to  the  multiplicity  of 
the  zero  characteristic  roots  of  the  matrix  MCM*'  .  Denote  this  multi¬ 
plicity  by  s.  Now  the  rank  of  MCM*'  is  equivalent  to  the  number  of  non¬ 
zero  characteristic  roots  of  MCM*',  say  r,  which  is  the  number  of  linear 
discriminate  functions  needed.  Hence,  we  have  the  number  of  discriminate 
functions  needed  given  by 


r  -  p-s 

Let  the  s  covariatts  be  given  by 

£*'  UK,  £*■(?>£.  **•  ,  (sK 
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and  let  the  matrix  h  be  defined  by 


L 


'  **'(1) 

£*' (2) 


£*'<s) 


sxp 


(3.2.2) 


then 


L  p  . 
sxp  pxk 


vE 


Ik 


=  fv,  v, 


,  v] 


since  £*'(t)£  (t  =  1,  2,  •••  ,  s)  is  a  covariate  and  this  implies  that 
£*'(t)u(a)  =  v,  (t  =  1,  2,  ,  s;  a  =  1,  2,  ,  k) .  Since  Ml), 

£  (2) ,  •••  ,  £ (s)  are  independent,  this  implies  that  the  rank  of  L  is 
s,  i.e.,  r(L)  =  s  .  Thus  the  hypothesis  of  needing  r  discriminate  functions 
is  equivalent  to  the  hypothesis  that  there  exists  a  matrix  L  of  rank  s 
such  that 


Lu  “  vE 


lk 


for  some  unspecified  v  .  A  test  for  this  hypothesis  will  now  be  developed 
by  using  the  likelihood  ratio  criterion. 

Recall  that  the  likelihood  function,  Q,  was  given  as 
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By  adding  and  subtracting  £ (a } 
written  as 


n 

1  a 

~  K  ?  and  simplifying,  Q  may  be 

«  2.^1  _f'  ' 


Assuming  E  to  be  known,  the  Q  can  be  written  as 


Q  «  ^constant  w.r.t.  unknown  parameters,  u(u)) 


•  exp 


/  k  > 

(-  l  ra(u  a)-vi(a))*'E-l(ua)-u(a)) 

\  a=1  ) 


The  parameter  space  ft  consists  of  the  2pk  parameters  given  by  u(l), 

W  (2) ,  •••  ,  y(k)  .  The  maximum  of  Q  with  respect  to  ft  is  given  when 
y(a)  -  |(a)  . 

Now  the  likelihood  function  must  be  maximized  subject  to  the  con¬ 
ditions  of  the  hypothesis  of  needing  r  discriminant  functions.  This  is 
equivalent  to  the  hypothesis  that  there  exists  a  matrix  L  such  that 


H:  l4J(a)=v  a  ■=  1 ,  2,  •  •  •  ,  k 


or 


H: 


Lu  «=  vE 

-  Ik 


which  is  equivalent  to  saying  the  r (MCM* ' )  =  r  .  In  all  cases  it  should 
be  noted  that  the  matrix  L  and  vector  v  are  not  specified.  The  maximi¬ 
zation  under  the  assumption  that  H  i6  true  will  b^  -arried  out  through 


the  following  procedure. 
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Make  the  following  transformation 

LC(a)  =  ^(a)  a  =  1,  2,  ,  k 

Ll§(a)  =  a  =  1,  2,  •••  ,  k 

choosing  such  that 

lsl*  1  =  p  . 

2 

Letting  R  denote  the  exponent  of  Q,  then 

-  k 

R  -  j>  n  (1(a)  -  u  (a))  * '  I  (?{a)  -  ji(a>) 
a=l  a 

k  , 

-l  n  (L?(a)  -  Lu  (a)) * 1  (Li L* ' )  (l£(a)  -  Lp(a)) 

a=l  a 

k  ,  _ 

+  l  na(L1|(o)  -  L1u(a))*’  (L^L*1)  (l^a)  -  L^fa)) 


and  under  the  assumption  that  the  hypothesis  is  true,  the  above  becomes 

k  1 

R  =  l  n  (y_(o)  -  v)*'  (LLL*')  (y(«)  -  v) 

a-1  a 

k  , 

V  l  n  (z(a)  -  L^y  (a))  * '  (L^£L*' )  (z(a)  -  L^to)) 

a=l  “  1 

2 

and  to  maximize  the  likelihood,  the  exponent  R  is  minimized  (observing 
2 

that  R  is  real)  .  Hence 


Min  R 


Min 


k 

y 

=i 


(v(a)  -  v)*1  (L-lL*  1 )  ~  (v  (a)  -  v) 


where  B  is  the  between  groups  s.s.  and  s.p.  matrix  from  the  CMANOVA  table. 


:  /■> 

now  i***  -  -  tt  * 1  ;  ♦’ 


Min  P2  =  Min  tr(TT«’)(;  ~ 1 '  'til  ~l /2T* • ) 


-1/2  -1/? 

=  Min  tr  Y(X  P,r  '  )y*’ 


- 1  -'2 

where  Y  =  (TT*1)  '  ",  observing  YY* ’  -  J.  So  the  problem  of  minimizing 

2  -i/2  -1/2 

R  with  respe'-r,  to  n  reduces  to  minimizing  the  tr f Y (Y  bE  ')Y*'] 

subject  to  the  ccndi  tion  that  7Y* '  in  the  case  of  real  positive 

definite  symr.et  r*  o  natri  cos  ,  the  niri  m”-i  '  r  oiv  n  b'r  the  sum  of  tne 

-1/2 

smallest  s  roots  of  E  BE  '  [Kao(19'.'5)  ] ,  i.e.,  the  s  small*'',  roots 


of 


-1/2p-.-]/2  -  \.i! 


i,  ?, 


or 


|b  -  x  r;  o  i  -  l,  2,  •  •*  ,  s  . 


Thus  the  maximum  value  of  Q  under  H  is  given  as 


-1/2  -1/2 

(constant)  *  exp(-  sum  of  the  smallest  s  roots  of  l  BE  ) 


and  the  likelihood  ratio  statistic  X  is  given  as 


-1/2  -1/2 

(sum  of  the  sitallest  s  roots  of  BY  ) 


- 2  In  A  =  (sum  of  s  smallest  roots  of  Y.  Y. 


2  ( A  +  >  +  •  •  • 

r t  1  r t2 


’>  1 
P 


and 


'at?:  I 


Part  IT 


•tap  Zt  Mia  of  fart  II,  l.«.f  ala  of  Part  XX  with  rasps ct  to 


(a) .  The  natar  of  parameters  restricted  was  2  (p-s)k 
or  2rs. 

Step  II:  Min  of  Part  I,  i.e.,  min  of  Part  I  with  respect  to  v  . 

The  number  of  parameters  restricted  was  2s. 

This  left 

Min  R2  »  Min  tr (LEL* 1 ) _1 (LBL* * ) 

(w. r.t.L) 

Now  L  is  an  s  *  p  matrix  of  rank  s  and  the  number  of  parameters  restricted 
needs  to  be  determined.  Write 

L  «  a[H  |  G  ] 
s  p-s 

and  note  that 

H_1L  «  [I  |  H_1G]  . 

0 

Returning  to 

Min  tr (LEL* ' ) (LBL* 1 ) 

which  can  be  written  as 

Min  tr[L*'  (LEL*')‘1(IJB)  ] 


or  as 


99 


where  1  :•  c'j 


thus 


*.P»  * 


'•  .  <»r.  matrix. 

V 


Choose  T  to  be  H 


[it. 


Jo 


i 


where 


hence  the  min'  .run  in,roJvr»3  only  fmcfci  ons  of  ,  whore  L  has  dimension- 
ality  given  by  :•  '  ( f.< -t,  1  .  Thus  ot.lv  -  >  (p-s) ]  parameters  are  restricted 

in  the  min imi ?. at  i ■  :  *’  •  ( I..-  '  ‘  .  Therefor**  the  number  of 

parameters  rorr.ri  .?  'in-dt-r  ;*  is  ,-i  vnr.  ar. 


.  r/. 


and  the  degrees  c  f  rro-vnn  or.  the  • -square  test  is  given  by 
*1.  f.  =  2pk  -  1 2  rk  +  2s  *  2s(p-s)] 

:.'-r)  q-r- 


where  q  =  k+1  . 

In  summary,  the  teat  of  the  hypothesis  that  the  number  of  linear 
discriminant  funot  i-'ns  :  r.  ct  the  number  of  covariates  is  s  -  p-r,  is 
given  by  the  ' :  -.e  1  !'■  i  r  -*■  ■  -  *  t  ■  i  c  as 

"  '  r*  1  r  *  2  p  -2(p-r)(q-r) 

where  \  ,  ■  i r<~  the  s  smallest  roots  of 

r+ 1  !  • : 


where  E  is  th-‘  -  s .  s .  and  s.p.  matrix  obtained  from  the 


CMANOVA  tai  1  . 


If  E  1*  MMiaflW,  Umb  tW  eitlMtti  \Al-q,  <*•!•  A  iO  tfco  Witfcin 


•  4.  M  11.9. 


,  My  be  Mtd  to  obtoia  a 


tost  for  tbo  nvNr  of  discriminant  fractions  bmM.  the  toot  will 


bo  giwra  am  the  •  smallest  roots  of 


B - A  -  IB  -  nAl  -  0  . 

n-q  1  * 


The  test  statistic  is  given  by 


(n-ql  (2)  (Vl  *  V2  +  *  V  '  X2(P-D  (q-r) 


where  n  , ,  n  „ ,  •  n  are  the  s  smallest  roots  of 
r+1  r+2  p 


|  B  -  nAj  =  0^ 

Instead  of  considering  the  roots  of  |b  -  nA^ ■  9,  consider  the 


roots  of 


i.e. , 


B  -  r  (A+B) I  -  0 


B - -  A  -  0 

1-r  1 


2  :  2 

r  A  r 

n  “  - r  or  - -  - — 

,  2  n-q  ,  2 

1-r  1-r 


1  +  "  "  1-r2 


•1 


for  large  n.  Qiiaf  the  nfeetitmtion 


than 


loq^l  ♦  ■*  -  log(l-r2> 


X  •  2 
-  -  -  log(l-r) 
n 


and 


or 


A  +  A  +  •  ■ 
r+l  r+2 


+  A 


_E  _ 


-  log  II  (1-r  ) 
i-r+1 


P  2*2 

-  2n  log  F  (1-r.)  'v  \  .  .  .  . 

i;ir+1  i  2  (p-r)  (q-r) 


where  the  r7  are  the  roots  oi  |  B  -  r(A+B)|  *=  0.  This  gives  a  procedure 
for  estimating  the  dimensionality  of  the  mean  space.  The  procedure  is 
to  tert  sequentially  the  following  hypotheses. 

dimensionality  space  criterion 


P 

2 

* 

2 

H: 

r»0 

0 

1  pt. 

-  2n 

log 

n 

i-i 

(1-rp 

'X, 

X2pq 

p 

,  2. 

* 

2 

H: 

r=»l 

1 

2  ots. (collinear) 

-  2n 

log 

n 

i-2 

(1-rp 

X2  (p-1)  (q-1) 

P 

.  2. 

4 

2 

H: 

r-2 

2 

3  pts. (coplanar) 

-  2n 

log 

n 

i™3 

(1-rp 

X2 (p-2) (q-2) 

If  the  means  are  known,  then  the  number  of  discriminant  functions  can  be 
determined  exactly  by  examining  the  rank  of  U  where  is  defined  as 


'■440  • 


To  mo  this,  consider  L  as  'll  fine  il  by  (3.2.2)  an d  ondar  H,  tfcs 
hypothesis  that  the  mafcer  o'  discriminant  functions  is  r,  then 

Hi  -  VEU.  . 

Observe 

or 


L  y 
sxp  px 


k 


9  ■ 


(3.2.3) 


Now  the  rank  of  I,  is  s  since  Ml),  A  (2),  ,  Ms)  are  linearly  inde¬ 

pendent  and  this  is  the  exact  nur.ber  of  covariates  that  exist.  From 
(3.2.3)  ,  we  have 


L  E  =0 
Sxp  pxp 

where  E  «*  y  ,  y*‘  ,  and  from  (3.2.3)  we  have 
pxp  pxk  kxp 

£.*’  (1) E  -  0 
(2)E  =  0 


(3.2.4) 


**'  (s ) E  -  0 

and  there  are  no  more  non -trivial  solutions  to  the  system  Et  *  0,  thus 
we  have  the  r(E)  =  p-s  which  is  the  same  as  r(y)  •  p-s  -  r  . 

Thus  the  rank  of  y  is  the  same  as  the  number  of  discriminant  functionu 
needed.  Attention  should  be  called  to  the  fact  that  this  is  the  exact 


number  of  discriminate  functions  needed  and  not  an  estimate. 


•s 


t»  ths  first  csctioa,  Wilks1  A  «mi  dofimd  m 


A 

c 


M 


and  it  was  shown  that  the  test  statistic  for  the  hypothesis  of  equal 
means,  i.e.,  y(l)  «  u(2)  -  •**  »  M(k),  was  given  by 

"*  109  cA  *  X2pq  *  (3*2*5) 

The  distribution  and  correction  factor  m  2n  -  p  -  q  were  derived  by 
Khatri  (1965). 

When  the  dimensionality  of  the  mean  space  is  zero,  this  is  the  same 
as  saying  all  of  the  means  are  equal  and  the  test  was  given  as 


r  0*2 

-  2n  log  n  (1-r')  n,  x2  (3.2.6) 

i»l 

where  the  r*  are  obtained  from  |b  -  r  (A+B) |  -  0.  But  the  hypothesis 
that  the  dimensionality  ic  zero  is  the  same  as  the  hypothesis  that 
y(l)  ■»  jj (2)  w(k).  Thus  the  two  test  statistic  (3.2.5)  and  (3.2.6) 

should  be  equivalent.  Since 


A 

c 


|  a+b! 


P  2 
H  (l-r) 

i-1 


it  is  seen  that  the  two  tests  differ  only  in  the  constants  ■  »  2n  -  p  -  q 
and  2n.  Khatri's  result  has  bran  corrected,  so  it  will  be  used  in  (3.2.6) 
and  the  correction  will  be  adjusted  in  general  without  a  formal  proof. 

Thus  a  test  for  dimensionality  being  r  will  be  g^ven  as 


n.  -  (p-n  -  w-m  *i  ^  «-*]'  ;  'itp-n*-*  • 

tm  ataUd  before,  tbs  hffpamAi  tUt  tbs  ii— flwsllty  la  r  la 

equivalent  to  tha  hypothetic  of  -  v,  for  a  •  1,  2,  •••  c  k  , 

which  is  also  equivalent  to  tha  hypothesis  that  r (L)  -  a  •  p-r.  Now  tha 

matrix  L  and  the  vector  v  are  unspecified,  and  for  a  new  observation  £ 

from  ir  (a  =  1,  2,  •••  ,  k) ,  L£  would  provide  the  s  covariatee,  1#,(1)£, 

1*'(2)C,  •••  ,  £,*' (sK  .  New  these  s  variables  all  have  the  same  mean 

in  it  (a  ■  1,  2,  •••  ,  k)  and  are  no  good  as  discriminators.  Consider 

now  L  E  such  that  II L*'  *=  3  .  Now  these  r  variables  are  un  cor  related 
rxp*  1 

with  L£  and  these  can  be  used  as  discriminators.  Unfortunately  neither 
L  nor  is  known,  but  the  maximum  likelihood  estimator  of  L  can  be 
obtained . 

In  deriving  the  likelihood  ratio  procedure,  the 
Min  tr(HL*,)~1(LBL*') 

was  considered.  Now  the  maximum  likelihood  estimar-r  of  L  would  be  the 

-1 

matrix  L  that  does  minimize  the  above.  Recall  that  T  ^  II  and  the 
above  became 

Min  tr (TT* 1 ) -1 (TE 1/2BE 1/2T* 1 ) 

i 

1/2 

and  then  sot  Y  *■  (TT*')  T,  and  obtained 

Mr,  tr^of1''^:"172^*'] 

«r 

'Sxp 


subject  to  YY* ’  I  . 
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To  find  the  solution,  consider  the  last  s  eigenvectors  Y  satisfying 

(E“1/2BE“1/2  -  X  I)Y  «  0  (3.2.7) 

-pxl 

corresponding  to  the  s  smallest  eigenvalues  cf 

|e"1/2BE~1/2  -  X  l|  -  0  .  (3.2.8) 

The  condition  YY*'  «*  I  will  be  satisfied  since  the  eigenvalues  correspond¬ 
ing  to  distinct  roots  are  orthogonal  and  eigenvalues  corresponding  to 

.*  A 

repeated  roots  can  be  made  orthogonal.  We  can  take  T*'  to  be  Y*'  from  the 
s  orthogonal  eigenvectors  [Y. ,  Y  ,  • • •  ,  Y  ]  corresponding  to  the  s 

**x  ~3 

smallest  roots  and 

L**  *  =  E-1/2 y*« 

and  this  is  the  maximum  likelihood  estimator  of  L*'  . 

If  we  consider  the  largest  p-s  eigenvectors  corresponding  (3.2.7) 
and  (3.2.8),  these  will  provide  the  r  discriminant,  functions.  To  shew 
thro  LEL*'  -  j?  ,  consider 

-1 '2  /2 

(E  '  BE  *'  -  X  I)Y  o  0 

or  a 

(BE~1/2  -  X  E1/2  I'Y  <«  0 

or 

(B  -  X  E  I)E"1/2Y  «  0 

or 

(B  -  X  E  1)1  «=  0 

-1/2 

where  l  =  E  Y  .  Now  the  Y's  were  mutually  orthogonal,  i.a. , 
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Y#* (i)Y(j) 


i  *  j 
i  =»  j 


but  l  «■  £  "^Y  ,  thus  £*'(i)El(j)  ■  0,  which  implies  that  LEL’^  “  Jf  . 
Now  consider  a  new  observation  K  from  one  of  the  populations 

tt  (t  ■  1,  2,  •••  ,  k).  It  is  desired  to  place  this  observation  into 

a 

one  of  it  (a  «=  1,  2,  •  •  •  ,  k) .  The  procedure  for  this  discrimination 
a 

process  is  as  follows: 


Step  1:  Find  the  matrices  B  and  A  by  finding  the  CMANOVA  from 
the  saople  observations. 

Step  2:  Obtain  roots  cf 
2 


B  -  r  (A+B)  |=0  >  r2  >  •  •  *  > 


P 


Stop  3:  Determine  the  dimensionality  sequentially  by  finding  the 
smallest  val’e  of  r  such  that 

""l09  i-L  <l'ri>  ;  X2(P-rMq-t) 

is  insignificant. 

Step  4-  Determine  the  eigenvectors 


tB  -  r^  (A+B)  3 Mi)  -  0  i  «  1,  2, 
2  2 

corresponding  to  r^,  • • ••  ,  . 


Step  5:  Normalize  the  Mi)  by 
t*’  (i)E£(j) 
if  E  is  not  known  use  E  •»  A/n-q  . 


1  i-j 
0  i*j 
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Step  6:  Then  the  discriminant  functions  are 

1*'(1)§  -  Y1 
i*‘(2)§  -  Y2 


t*'  (r)£  -  Yr 

If  a  new  observation  is  observed  and  the  measured  charac¬ 
teristics  are  recorded  as  »  then 

•'^io 

*'  (rK0 

The  distance  (modulus)  of  this  point  from  the  estimated 
mean  of  ir^  is  the  quantity  that  will  determine  which 
population  that  5^  will  go  into,  i.e., 

Ju,50  *  l~*  (1) * ( 1) § <a> 1  * ' t £ * ' ( 1} ' Cll f (a) ] 

+  ...  +  [i*’(r)§0-^,(r)l(a)]*'t£*,(r)50-4*,(r)l(a)] 

“  (L^0  -  1^1(0)  -  L^ta)]  . 

Step  7 1  Determine  the  minimum  of 

(di4o  '  d2.§0  '  '  dMo I 

2 

and  assign  it,  if  mininura  is  d.  _  .  In  case  of  a  tie,  one 

j  3 'So 

could  randomize. 
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around  the  case  wore  the  dimensional:! ty  .is  one,  i.e. ,  only  one  discrim¬ 
inant  function  is  needed  and  with  tests  about  this  single  discriminant 
function.  In  this  section  a  test  of  the  goodness  of  fit  of  a  single 
hypothetical  function  will  be  developed.  This  hypothesis  will  consist 
of  two  parts,  (i)  whether  a  single  hypothetical  function  is  adequate, 
and  (ii)  whether  the  hypothetical  function  agrees  with  the  true  dis¬ 
criminant  function.  As  in  the  real  case,  the  hypothesis  will  he  stated 
as  the  collinearity  aspect  and  the  direction  aspect.  The  test  statistic 
will  be  partitioned  info  two  parts,  one  corresponding  to  the  collinear¬ 
ity  aspect  and  the  other  to  the  direction  aspect.  Independence  between 
these  factors  will  be  shown. 

The  CMANOVA,  as  presented  in  the  last  sections,  can  be  considered 

to  be  a  regression  of  the  complex  vector  E,  on  the  real  dummy  vector 

variable  Y  .  The  dummv  vector  variables,  Y, ,  Y_,  •••  ,  Y  ,  have  com- 

-1  -2  -q 

ponents  of  either  0  or  1  depending  on  which  population  the  5  comes  from, 
i.e.. 


Y'  =  [0  0  •••  0  11  •••  1  •••  0  0  •••  0] 

ni  na  \ 


and  the  regression  can  be  written  as 


E(£)  = 


if  f;  e  n 


1,  2, 


,  k 


Hk  + 


Hk>h  + 


+  (y  -  v.  >Y 

-q  -k  -q 


E(X) 


HkElN 


+  BY 


which  can  be  rewritten  as 
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where 


X 


f-ll  -12 


] 


and 


with 


V  ■  'V  h-  ■■ 


I  ] 
-q 


B  ■  [“-l  -  l!k'  H2  -  ttk 


With  this  in  mind,  the  CMANOVA,  may  be  written  as 


Source  of  Variation 

d.f. 

s •  8 •  find  s  *p* 

-l 

Regression (Between) 

q 

B  =  C  C  C 

xy  yy  yx 

-1 

Error (Within) 

n-q 

A  -  C  =  C  -CC  C 

xx  y  xx  xy  yy  yx 

TOTAL 

n 

A+B  *  c 

XX 

where  for  J  *  (I  -  ^  E  ) ,  then 
N  N  NN 


C  =  Y’JY 

yy 

c  =  XJY 
xy 


C  »  Y'JX*’ 
yx 

C  >=  XJX*’ 
xx 


Define  A^  as  the  matrix  consisting  of  the  first  i  rowe  and  columns 
of  A  and  likewise  for  E.  With  A^  =  1  and  »  1.  Then 
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A. 

i 


A  - 


|A| 

|  A+hl 


P 

n 

i-1 


'i-11 


VV 


which  can  be  written  in  terms  of  regression  as 


Ail 


A  = 


P 

n 

i=i 


K-i! 


p  1  "  \  (£,,  5,,  •••  ,  Y*> 

n  —  1  1  * 


’i-1'  - 


lVBil 


i=i  i  -  r; 


A.  .+B .  . 

1  l-l  l-l1 


VV  e2'  '  W 


where  R._  Ir.  _  _  ,  is  the  multiple  coherence  of  C.  with 

«i«l'  €2»  '  5i-l}  1 

^l'  ^2 '  *  *  *  '  ^i-1  *  This  fact  follows  from 


*'-l' 


1  -  R 


V  '  ei-l'  X'* 


lVBil 


i  -  r; 


A.  -+B. 


VV  V  *“  '  W 


which  is  evident  from  (2.4.4).  It  can  be  shewn  that 


W  V  *“  '  ^i-1'  X2  [n-  (gtl-i)l 


■  B(n-q+l-i,  q) 


1  -  R 


ti«i.  52,  •••  ,  5^)  X2  [n-  (q+l-i)  J  +  X2q 


Thus. 


1  -  Rt  , 

P  »  £9 > 

n  — — — — 


'  W  D 


i-i  1  -  r; 


e2. 


'  W 


n  u. 
i-1 


(3.3.1) 
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whero  ui  ^  B (n-q+l-i ,  q) .  Tnis  is  the  result  obtained  by  Khatri  (1965), 

who  has  shown  that  the  (i  «  1,  2,  •••  ,  p)  form  an  independent  set 

of  beta  variables.  Another  definition  of  A  is  given  by  equation  (3.3.1), 

c 

p 

i.e.,  if  a  s  .utj.dt.ic  A  is  distributed  as  II  u.  jihere  u.  (i  *»  1,  2,“*,  d) 

C  i  i  - 

are  independently  distributed  as  B (n-q+l-i,  q) ,  then  A  is  said  to  have 

c 

a  complex  Wilks’  A  distribution  with  parameters  n,  p,  and  q,  denoted  by 
A(n,  p,  q) .  Thus 


A 

c 


n  u. 
i=l 


«=  u  A(n-l,  p-1,  q) 

*  u.,  u2  cA(n-'?,  p-2,  q)  . 


The  hypothesis  of  equal  means  can  be  restated  in  terms  of  the  re¬ 
gression  of  (  on  V  as  one  of  no  association  between  £  and  Y  .  This 
hypothesis  can  then  be  broken  into  the  following  sub-hypotheses 


Hi!  5^  has  no  association  with  Y 

H2:  ^2  has  110  association  with  Y 

H  :  £  (eliminating  £. ,  ••*,(;  ,)  has  no  association  with  Y  . 

p  p  12  p-1  - 

If  the  overall  hypothesis  of  no  association  is  rejected,  then  at  least 
one  of  the  (i  »  1,  2,  ,  p)  is*  rejected. 
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Observe  that  the  criterion  for  the  sub--'-*  pothasis  is  given  by 


I  A.+B. 

1  1  l 

I  A.  ,+B.  ,1 
1  i-I  i-l* 


1  -  R, 


V5i'  ,  5^.  v’) 


i  -  r; 


^2'  ***  '  ^i-1^ 


It  is  desired  to  develop  a  goodness  of  fit  test  for  a  hypothetical 

discriminant  function.  Denote  this  function  by  £*’£  «  Z^  and  consider  a 

non-singular  trans formation  of  £  -*•  Z  by  LC  =  Z  where  the  first  row  of 

Tj  is  £*•  .  It  is  now  desired  to  test  that  Z^  is  the  reason  that  the 

dimensionality  is  not  zero.  Considering  the  population  means  of  the 

transformed  variables,  Z,  is  the  one  that  differs  from  Z.,  Z-,  **•  ,  Z 

1  2  3  p 

If  this  is  true,  then  after  eliminating  Z^  the  set  Z^»  ,  Z^  will  have 

no  association  with  Y  . 

Thus  the  test  statistic  is  given  by 


~  eA  P  - 
A  “  r~  “  n  u. 

=  U1  i-2  1 


where  cA  =  |  LAL* ’  |  /( L(A+B) L* 1 |  ,  i.e.,  the  transformed  CMANOVA.  But 
observe 


A  =  — 


L(A+B)L*’  A+B 


thus  the  test  statistic  is  given  by 


c  -  £*’ (A)£/£*’ (A+B)£ 


(3.3.2) 
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but  from 


A 

c 


P  2 
n  (l-rp 

i=»l 


so  the  test  statistic  becomes 


p  2 

n  (1-rH 

i=l  1 


u 


1 


P  2 

n  (l-rf ) 

(A+B)£  J  j 

1=2  1 

i 

(3.3-3) 


Cbserve  that  the  second  factor  is  the  tes*-  statistic  for  the  dimension¬ 
ality  of  one.  Th2  first  factor  tests  that  the  discriminant  function  is 

given  by  £*'£  .  Cbserve  further  that  the  two  factors  are  not  indepen- 

2 

dent,  since  the  r7  (i  =  1,  2,  ,  p)  are  not  independent.  Prom 

2 

(3.3.3)  an  approximate  X  test  can  be  constructed. 

Now  a  factorization  of  A  into  two  independent  factors  that  measure 

c 

the  direction  aspect  and  the  partial  collinearity  aspect  can  be  achieved 
or  _A  can  be  the  factor  into  two  independent  factors  measuring  the 
collinearity  and  'partial'  direction  aspect.  The  use  of  the  word  'partial' 
will  be  discussed  later.  In  the  real  and  general  case  this  factorization 
wris  developed  by  Bartlett  (1951),  who  gave  a  geometrical  argument. 
Kshirsagar  (1970)  gave  an  analytical  argument  for  the  same  factorization. 
The  wort  presented  here  in  the  complex  case  is  a  corabination  of  the  two, 
with  extensive  use  of  Kshirsagar's  wort,  ihe  factorization  of  cA  into 
the  direction  and  'partial'  collinearity  aspects  is  presented.  The 
alternate  factorization  can  be  achieved  in  a  similar  manner. 

If  the  hypothesis  fc^at  £*'£  is  the  only  discriminant  function  is 
true,  then  £*'(;  is  the  oniy  variable  that  has  a  different  maan  among  the 
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k  populations.  Any  other  linear  combination  of  uncorrelated  with 
has  the  same  mean  in  all  the  k  groups.  Let  L  be  a  p  *  p  non¬ 
singular  matrix  with  l*'  as  its  first  rcw  and  let  L  be  such  that 
LEL*'  =  I  .  (There  is  no  loss  of  generality  for  thic  assumption.) 
Transfona  from  5  to  Z,  where  Z'  =  [Z,,  Z0,  *•*  ,  Z  ]  by 

“  “  ~  JL  Z  P 

Z  =  L  §  . 

Then  Z^  is  the  hypothetical  discriminant  function  and  •••  ,  Z^  are 
all  uncorrelated  with  Z  »  as  LlL*'  =  I  .  Under  the  hypothesis, 

Z  ,  •••  ,  7.  all  have  the  same  mean  in  the  k  groups.  The  test  for  the 

/.  p 

hypothesis  has  been  developed  and  was  shown  to  be  on  (3.3.1) 

A  A 

A  =  - - - .  (3.3.4) 

C  (A+B) 9.  c  l 


argument.  Let  D  be  the  between  groups  matrix  for  the  transformed 


variables  Z,  then 


D  =  LBL*'  **  [d^] 


and  the  within  groups  matrix  is  W,  given  by 


W  °  LAL*'  -  [w^]  . 

As  in  the  real  case,  it  is  obvious  that  W  has  the  Wishart  density  give*) 

by  GJp(wjn-q|l)  since  L£L* 1  I  and  likewise  D  *>  LBL* '  is  distributed  as 

a  non-central  complex  Wishart,  with  the  non-centrality  parameter  due  only 

to  the  difference  in  means  of  Z^  alone  in  the  k  groups  and  thus  affects 

2 

b^  only,  i.e.,  b  is  a  non-central  x  with  2q  degrees  of  freedom.  So 
the  density  of  D  may  be  written  as 


CWo(:j!g|l)§  (b]L1) 


where  §  (b^)  is  the  contribution  due  to  the  non-centrality.  An  explicit 
expression  for  i (b^7)  will  not  be  necessary  in  this  development. 

New  W  'v  CW^(w|n-q|l)  where  W  -  [w^].  From  this  we  have 


V  V#  V 

il  -  il  11 

— -  (i  -  2,  •••  ,  p)  and  w  ■  w - (i,j  -  2, 

11  ^7  11  ij  "u 


are  independently  distributed.  The  -  are  c'T®lex  standard  normal 

and  independent  variables,  while  the  matrix  W  ”  [w^)  is  a  cca^>lex 
Wishart  of  order  o~l  .  This  follows,  as  in  the  real  casa,  from  the  fact 


that  if  S  ^  CW  (S  n  I)  .and 
P 


[S21  S22j 

then  S^2.  ^  ”  S.^  ”  S21S11S12  *"a  or<*®r  P-^  snd  distributed  as  a  complex 
Wishart  and  is  independent  of  and  S  •  So  applying  this  result, 

we  have 

W  ^  CW  (w| n-q-lj I)  . 

P  ^ 

In  a  similar  matter,  r  can  be  split  up  as 


il 

Vdn 


(i  =  2, 


d  d 

,  p)  and  d  =  d - --- J~  (i,j  =  2, 

3  J  11 


/  P) 


1/2 

and  are  independently  distributed.  The  d^/(d^)  are  CN^(0,  1)  and 
are  independent  and  D  =  [d. . J  is  distributed  as  CW  , (5|c-l|l)  .  All  of 

p-1 

these  elements  are  independent  of  W  and  hence  W  . 


Consider  the  variable 


0  .  = 
y 


-  /  dllWH  ( J-l  _  ^i 1 \ 

\dll+Wll  ^dll  W11 J 


to T  i 


2, 


»  P 


which  i3  a  CN^(0,  1)  ^andom  variable.  To  see  this  consider  the  conditional 
density  of  for  d^  and  w  fixed.  This  is  CN^(0,  1)  but  does  not 
depend  on  w  or  b  ,  henoe  it  is  jnconditional  distrit  itions.  Let 

0  denote  the  colum  vectors  0^  (i  °  3,  ,  d)  . 

Thus  the  following  results  -.re  obtained 


W  v  CW  ,  (W J  n-q-1  j  T.) 


D  ^  CW  'D  q-ll I) 

p-1 
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and  all  are  independent,  observe  that  90*'  is  distributed  as  a  pseudo- 
complex  Wishart  with  one  degree  of  freedom. 

Recall  that  vviiks:  ^  with  parameters  n,  p,  q  was  defined  as 


c 


1a| 

|a+b! 


A (n ,  p,  q) 


where  A  n,  cw^(A)n-qjZ)  independent  of  B  ^  (B |  q |  E ) .  The  parameter  n 
is  the  d.f.  of  A  plus  the  degrees  of  freedom  of  B;  p  is  the  order  of 
A+B ;  q  is  the  degrees  of  freedom  of  B.  Using  this  terminology,  then 


W 


w+9  e*' 


^  A (n-q-1+1,  p-1,  1)  =  A_ 

C  o  5 


w+e  e*' 


w+m-e  e*' 


n.  A(n-1,  p-1,  q-1)  =  A. 
c  c  4 


W 


|  w+d} 


n-  A(n-1,  p-1,  q-1)  =  A 
I  C  C  3 


W+D 


W+D+6  0*' 


^  A(n-1,  p  -1,  1)  *  A„ 

c  c  2. 


Furthermore,  independence  between  A_  and  A,  Lz  obtained  or  independence 

c  j.  c  3 

between  A„  and  is  obtained.  The  same  argumsnt  for  the  real  case  is 
c  4  c  5 

valid  in  the  complex  situation,  i.e.,  if  A  a,  CW  (a|  f  [  I)  and  5.  ^  CN  (0, 

r  ^  Jr 

(i  »  1,  2,  •  ■  •  ,  n) ,  then 


I  a! 


Ia+  y  s.e*' 
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in  independently  distributed  of  A  +  £  £.£**  • 

i=l 

New,  it  must  be  shown  that  ^A  is  indeed  given  by  the  product 
d^2  *  c^3  ’  then  it  will  be  shown  that  ^A^  is  the  direction  factor 

and  ^  is  the  'partial1  colline&rity  factor*  The  alternate  factori¬ 
zation,  cA  “  ,  can  also  be  shown.  Consider 


M 


cA2  ‘  cA3 


w+eh-6  e*‘ 


w1 


w 


11 


w+D+e  e4-' 


and  observe,  that 


jff  +  D  +  8  8*'  j  »  r — j -  |w+d| 

11  11 


(3.3.5) 


then  the  above  can  be  written  as 

J«l 

W11 


Jw«| 

bll+wll 


(A+B)  Jl]  |liAL* ' 


(£*'A£)  |l(A+B)L*' 


Since  w^  =•  l*'P&  aad  (w^  +  b^)  **  i*'  (A+B)J, 
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i  .1 
1*1 


|a+b 


l* 'Mi 
A*' (A+B) A 


A 


and  it  is  seen  that  (3.3.4)  is  verified. 

Now  it  must  be  shown  that  A„  is  the  direction  factor  and  A_  is 

C  »  C  3 

the  'partial'  collinearity  factor.  Consider  A.  first.  To  see  this, 

C 

define  a  matrix  H  by 


W11 

!  W12  *  *  *  wiP 

W21 

wij  +  *ij 
(if j  -  2,°”,p) 

w  , 
Pi 

and  a  vector  k  by 


then  observe  that 


D  +  W  -  H  +  *  k*' 


and 


H|  -  wn|W  +  D 


(3.3.6) 


HO 


a  id  from  (3.3.5),  that 


|H  +  k  k*'(  =  (v;n+bn)|W  +  n  + 


6  0*' 


(3.3.7) 


Then  cA2  may  be  written  in  terms  of  (3.3.6)  and  (3.3.7)  as 


.  .  <bii+vn) 

c  2  w 


l'H| 


11  |H  +  k  k* ' | 


then  the  above  may  be  written 


l0ll+”u' 


(w 


,  , ;  -1 
U  1  +  k*'H  “k 


Since  |H  t  k  k*‘j  .  (h1  •  jl  +  H^k  k*'|  »  |H|  •  (1  +  k*.H"\) 
can  be  further  simplified  no 


and  A„ 
c  s. 


•U+'V  .. 

—  *  '1  "  k*'(H  +  k  k* *) 


tliis  can  be  seen  by  using  the  fact  that 


(X  +  PQ)  1  =  1  ~  P(I  +  QP) _1Q  . 


Noting  tnat 


!S*'  -  %*'  L  1DL*'  '  (l~^dl*  '”*)  8,] 


then  nay  Le  written  es 


.  (dii+v,i:)  |  ^*'L  ]DL*'  [L~ 1  (W^D)L*'"1]~1l“1DL*,“1 


; —  U - 


i! 
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and  observing  that 


bH+Wii  £*’  IL  1(W+D)L*'  1U 
W11  l*'  (L_1WL*,-1)£ 


then 


l * ' L-1DL*  * [L_1 (W+D) L* ' ~1]_1l"1DL* * 
(L_1DL*  *  ~ 1) 

I*’  [l“1(EH-W)L*,“1]S. 


and  using  the  fact  that  D  =  LBL*'  and  W  =  1AL*',  then 

1  -  Z*,B(A+B)"1BX.A*'B«, 

^  - 

&*' M/l*'  (A*B)  l 


Then  A  car  be  represented  as 
C  j 


cA3 


A 

_ |a+b| _ 

1  -  £*’B (A+B) "1Bf/R*'Bi 


Returning  to  the  CMAI30VA  for  the  transformed  variable  Z  ■  L£,  it 
can  be  expressed  as 


Source  of  Variation 

d.f. 

8.8*  dnd  8 •  p • 

-l 

Regression (Between) 

q 

LBL*'  *=  D  =  C  C  C 

zy  yy  y* 

Error (Within) 

n-q 

LAL*'  -  W  -  C 

zz*v 

TOTAL 

n 

L(A+B)L*'  -  C 

zz 
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The  A  can  be  exoressed  as 
c 


_5£ir 

C  I 
zz 


I c  1 

'  xx*  y 1 


XX' 


then  using  the  fact  that 


c 

c 

xx 

xy 

C 

C 

yx 

yy 

-  c 


c  c-1c  =  c  I  •  c  -  c  c"Ac 

uv  w  vrr  trtr 1  w  vir  m*  « 


-1. 


xx  yy  yx  xx  xy 


yy  1  xx  xy  yy  yx 


it  is  seen  that  another  expression  f  _r  ^A  is  give;;  by 


|C  | 

A  =  --Hr:*!. 


|c  J 

yy 


Now  (3.3.9)  is  a  measure  of  the  "lac;:  of  relationship"  between  Li 

-nd  (3.3.10)  is  a  measure  of  the  "lack  of  relationship"  between  1 

Multiplying  and  dividing  (3.3.10)  by  | Cyy.tl  >  the  residual  of  Y 

the  effect  of  2.  =  t  removed,  A  can  be  written  as 
1  c 
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Consider  the  first  factor,  using  (3.3.9),  this  can  be  written  as 
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Hence  another  expression  for  ^A  is  given  by 
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which  is  the  test  statistic  when  1*'£  ■  t  is  eliminated,  thus  measuring 
the  association  between  Y  and  Z,  eliminating  t  .  Now  cA  must  be  factored 
into  cA2  and  .  This  will  be  achieved  by  factoring  the  numerator  and 
denominator  of  (3-3.11). 

Let  §  denote  the  sample  projection  of  t  on  the  Y  space.  The  re¬ 
gression  of  5  on  Y  is  C  C  ,  thus  the  regression  o*  t  ■  £*'£  on  Y 

is  i*'C  C~*Y  .  The  sancle  projection  §  is  given  by  §  «=  fc*'C  C  or 
xy  yy-  xy  yy- 

by  §  -  m*'Y  where  m*'  =  Z*'C  C  1  .  Let  y  denote  the  remaining  q-1  linear 

-  xy  yy  L 

functions  of  the  Y's  .  Then  |c  [  can  be  factored  into  ! C-. .  | • | C  6  | 

yy*  2  3  9  Z  TT  “ " 

This  follows  from  (3.3.9).  Likewise  ic  J  may  be  factored  into 

yy  t 

|C  J  =1 C, ,  _| - 1  C  c ^ I  and  A  may  be  expressed  as 
1  yyt1  1  §§*t'  1  c 
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Now  it  will  be  shown  that 
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similar  matter,  A,  follows.  Thus  we  have 
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Now  J'  _  is  a  measure  of  the  association  of  § ,  the  sample  projection 

on  the  Y  space,  with  the  £  space  with  the  effect  of  the  discriminant 

function,  t,  removed.  Now  §  -  m*'Y,  where  m*'  ■  i*'C  C  *  .  Observe 

-  xy  yy 

that  m  is  determined  by  l,  the  given  direction,  thus  one  would  expect  no 
association  to  be  present  if  i*'5  »  t  is  eliminated.  Since  Y  •  m*'Y  is 
considered  first  and  then  the  ^ 's  are  considered  after  eliminating  ra*’Y, 

A.  is  called  the  'partial'  collinearity  facror.  Considering  the  y's 

C  4 

first  and  then  considering  i  after  the  elimination  of  y  will  give  the 
alternate  factorization,  i.e-, 


A . 
c  4 


Ac 
c  5 


where  A,  is  the  collinearity  factor  and  A_  is  the  'partial'  direction 
c  4  CD 

factor. 

A,  is  the  collinearity  factor  since  it  depends  on  the  number,  but 
C  3 

not  the  direction,  of  the  remaining  q-1  variables. 

In  summary,  we  have  the  test  for  the  hypothesis  that  i*'£  is  the 
only  linear  discriminant  function  needed  to  discriminate  among  the 
k  *  q+1  complex  p-variate  normal  population  based  on  the  statistic 
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where  *  A(n-1,  p-1,  1)  independently  of  cA3  ^  A(n-1,  p-1,  q-1)  >  where 
A,  is  the  direction  factor  and  A  is  the  'partial'  collinearity  factor. 

C  m  C  w 


CHAPTER  IV 


SUMMARY 

Through  the  work  of  Wooding  (1956)  and  then  Goodman  (1963)  the 
foundation  for  complex  multivariate  analysis  was  established.  This  work 
was  extended  by  the  major  contribution  of  Khatri  (1965)  who  developed 
much  of  the  basic  theory  needed  for  the  analysis  of  complex  multivariate 
normal  random  variables.  It  was  fchic  author's  intentions  cf  further 
extending  this  development  by  examining  the  complex  analogue  of  Wilks' 
statistic  as  used  in  the  multivariate  analysis  of  variance  procedure  and 
a3  used  in  the  discrimination  problem. 

Chapter  II  of  this  paper  contains  the  basic  theorems  necessary  for 
the  development  of  the  analogue  of  Wilks'  statistic  as  developed  in 
Chapter  III.  Also  included  are  results  pertaining  to  the  decomposition 
of  a  complex  Wishart  matrix.  This  deconposition  was  fundamental  in 
establishing  results  about  the  coherence  between  two  complex  random 
variables  and  the  echo  nco  between  two  conplex  random  vectors.  One 
possible  extension  of  this  area  is  the  investigation  of  the  complex  t- 
distribution  which  occurs  as  the  distribution  of  the  sample  estimate  when 
the  true  coherence  between  two  complex  normal  variates  is  zero.  This  is 
in  direct  analogy  with  the  real  case. 

The  development  pertain' .ig  to  Wilks'  statistic  was  presented  in 
Chapter  III.  In  Section  1  of  Chapter  III,  a  test  for  the  equality  of 
means  of  K  conplex  p-/ariate  normal  populations  was  developed.  is  was 
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accomplished  by  use  of  the  likelihood  ratio  criterion.  A  possible  ex¬ 
tension  of  this  section  would  be  to  examine  the  properties  of  this  test 
procedure  when  applied  to  complex  random  variables.  Some  work  has  been 
done  in  this  area  t/  Girl  (1865)  .  In  Section  2,  a  test  for  dimensionality 
of  the  mean  space  was  developed  and  it  was  shown  that  this  was  equivalent 
to  knowing  the  number  of  linear  discriminant  functions  needed  to  discrimi¬ 
nate  among  the  k -populations .  A  needed  extension  is  to  determine  the 
exact  correction  factor  that  should  be  used  for  the  chi-square  test.  In 
Section  3,  a  test  for  the  goodness  of  fit  of  a  single  hypothetical  dis¬ 
criminant  function  was  developed.  The  test  statistic  was  factored  into 
two  independent  par*  3,  one  for  the  direction  of  the  hypothetical  discrimi¬ 
nant  function  and  the  other  to  test  the  'partial'  collinearity  of  the  means. 

<»  possible  extension  would  be  the  goodness  of  fit  of  more  than  one  dis¬ 
criminant  function.  Another  extension  would  be  the  goodness  of  fit  of  a 
single  discriminant  function  from  the  vector  space  of  dummy  variables. 

For  the  p-variate  complex  normal  as  defined  by  Wooding  and  Goodman, 

Khatri  (1964)  has  noted  "that  one  can  handle  all  the  classical  problems 
of  point  estimation  and  testing  hypotheses  concerning  the  parameters  of 
complex  multivariate  normal  populations  much  as  one  handles  those  for 
multivariate  populations  in  real  variables."  This  is  true  for  the  prob¬ 
lems  that  have  been  considered  in  this  paper.  As  a  matter  of  fact,  all 

the  work  so  far  with  the  p-v?riate  complex  normal  as  defined  by  Wooding  | 

i 

and  Goodman  has  been  done  by  paralleling  the  real  case.  Even  so,  this  i 

type  of  development  is  important  because  complex  multivariate  analysis  , 

has  possible  use  and  applications  in  stochastic  processes  and  spectral 
analysis  of  multivariate  time  series  and  point  processes.  N.  R.  Goodman 
and  M.  R.  Dibman  (1968)  have  considered  the  t'r.  ory  of  time- varying 

i 
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spectral  analysis  and  complex  Wishart  matrix  proce'  ses  and  the  assumption 
of  etationarity  and  normality.  They  have  considered  complex  Gaussian 
processes,  Wishart  processes  and  time-varying  spectral  estimates  along 
with  distributional  results  associated  with  them.  This  author  feels  that 
the  work  reported  in  this  dissertation  will  be  helpful,  indirectly  at 
least,  in  such  investigations. 

David  Brillinger  (1968)  has  considered  the  canonical  analysis  of 
stationary  time  series.  The  distributional  results  about  canonical 
correlation  of  complex  normal  vec  ors  are  likely  to  be  useful  in  chis 
area. 
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